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Abstract 

Networks in which the processing of jobs occurs both sequentially and in parallel are prevalent 
in many application domains, such as computer systems, healthcare, manufacturing, and project 
management. The parallel processing of jobs gives rise to synchronization constraints that can be 
a main reason for job delay. In comparison with feedforward queueing networks that have only 
sequential processing of jobs, the approximation and control of networks that have synchronization 
constraints is less understood. One well-known modeling framework in which synchronization 
constraints are prominent is the fork-join processing network. 

Our objective is to find scheduling rules for fork-join processing networks with multiple job 
types in which there is first a fork operation, then activities that can be performed in parallel, 
and then a join operation. The difficulty is that some of the activities that can be performed in 
parallel require a shared resource. We solve the scheduling problem for that shared server (that 
is, which type of job to prioritize each time the server becomes available) when that server is in 
heavy traffic and prove an asymptotic optimality result. 

Keywords: Fork-join processing network; Scheduling Control; Diffusion Approximation; Asymp¬ 
totic Optimality. 

AMS Classification: Primary 60K25, 90B22, 90B36, 93E20; Secondary 60F17, 60J70. 

1 Introduction 

Networks in which processing of jobs occurs both sequentially and in parallel are prevalent in 
many application domains, such as computer systems (Xia et al. [2007]), healthcare (Armony et al. 
[2015]), manufacturing (Dallery and Gershwin [1992]), project management (Adler et al. [1995]), and 
the justice system (Larson et al. [1993]). The parallel processing of jobs gives rise to synchronization 
constraints that can be a main reason for job delay. Although delays in fork-join networks can be 
approximated under the common hrst-come-first-served (FCFS) scheduling discipline (Nguyen [1993, 
1994]), there is no reason to believe FCFS scheduling minimizes delay. 

Our objective in this paper is to devise control policies that minimize delay (or, more generally, 
holding costs) in fork-join networks with multiple customer classes that share processing resources. 
For a concrete motivating example, consider the patient-flow process associated with the emergency 
department at Saintemarie University Hospital (cf. Hublet et al. [2011]). An arriving patient is first 
triaged to determine condition severity, and then (after some potential waiting) moves to the patient 
management phase before being discharged. The patient management phase begins with the vital 
signs being taken and a first evaluation. Then, depending on the condition, there may be laboratory 
tests and radiology exams. Simple laboratory tests on the patient’s blood and urine can be performed 
in parallel with the patient receiving a radiology exam, such as a CT scan. The discharge decision - 
whether the patient can return home or should be admitted to the hospital - cannot be made until 
all test results are received. Roughly speaking, we can imagine a process flow diagram such as that 
in Figure 1, where the patient type corresponds to the condition severity determined at triage, the 
isolated operations correspond to the laboratory tests (which are necessarily associated with each 
individual patient), and the shared operation corresponds to the use of the CT scanner. The CT 
scanner is an expensive machine, and, as can be seen from the case teaching note (cf. Hublet et al. 
[2011]), has a large impact on patient wait time. This motivates us to study the problem of how to 
schedule a shared resource that is used in parallel with other resources. 
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Figure 1: (Color online) An example of a fork-join processing network with arbitrary number of job 
types, arbitrary number of forks associated with each job type, and a single shared server. 


The simpler fork-join network shown in Figure 2 serves to illustrate why fork-join network control 
is difficult. In that network, there are two arriving job types (a and b ), seven servers (numbered 1 to 
7), and ten buffers (numbered 1 to 10). We assume h a is the cost per unit time to hold a type a job, 
and hb to hold a type b job. Type a ( b ) jobs are first processed at server 1 (2), then “ fork ” into two 
jobs, one that must be processed at server 3 (5) and the other at server 4, and finally “ join” together 
to complete their processing at server 6 (7). There is synchronization because the processing at server 
6 (7) cannot begin until there is at least one job in both buffers 7 and 8 (9 and 10). Server 4 processes 
both job types, but can only serve one job at a time. The control decision is to decide which job 
type server 4 should prioritize. This decision could be essentially ignored by serving the jobs in the 
order of their arrival regardless of type (that is, implementing FCFS policy). Another option is to 
always prioritizing the more expensive job type, in accordance with the well-known cfi- rule. Then, if 
ha^A > hb^B where ha {hb) is the rate at which server 4 processes type a ( b ) jobs, server 4 always 
prefers to work on a type a job over a type b job. However, when there are multiple jobs waiting 
at buffers 8 and 10, and no jobs waiting at buffer 9, it may be preferable to have server 4 work on 
a type b job instead of a type a job (and especially if also no jobs are waiting at buffer 7). This is 
because server 4 can prevent the “join” server 7 from idling without being the cause of server 6’s 
forced idling. (Server 3 is the cause.) 

The fork-join network control problem is too difficult to solve exactly, and so we search for an 
asymptotic solution. We do this under the assumption that server 4 is in heavy traffic. Otherwise, 
the scheduling control in server 4 has negligible impact on the delay of type a and type b jobs. We 
further assume the servers 6 and 7 are in light traffic, which focuses attention on when the required 
simultaneous processing of jobs at those servers forces idling. The servers 1, 2, 3, and 5 can all be in 
either light or heavy traffic. 

In the aforementioned heavy traffic regime, we formulate and solve an approximating diffusion 
control problem (DCP). The DCP solution matches the number of jobs in buffer 4 to that in buffer 
3, except when the total number of jobs waiting for processing by server 4 is too small for that to be 
possible. The implication is that when server 3 is in light traffic, so that buffer 3 is empty, buffer 4 


2 









Figure 2: (Color online) A fork-join processing network with two job types and a single shared server. 


is empty and all jobs waiting to be processed by server 4 are type b jobs. Otherwise, when server 3 
is in heavy traffic, the control at server 4 must carefully pace its processing of type a jobs to prevent 
“getting ahead” of server 3. 

Our proposed policy is motivated by the observation from the DCP solution that there is no 
reason to have fewer type a jobs in buffer 4 than in buffer 3. If server 3 can process jobs at least as 
quickly as server 4 can process type a jobs, then the control under which server 4 gives static priority 
to type a jobs performs well. Otherwise, we introduce a slow departure pacing (SDP) control in 
which server 4 slows its processing of type a jobs to match the departure process of type a jobs from 
buffer 4 to the one from buffer 3. 

SDP is a robust idea that is applicable to the more general network topology shown in Figure 1. 
To see this, we formulate and solve approximating DCPs for the fork-join network in Figure 2 but 
with task dependent holding costs (cf. Section 10.1), a fork-join network with an arbitrary number 
of “forks” (cf. Figure 7 in Section 10.2), and fork-join networks with more than two job types (cf. 
Figure 8 in Section 10.3.1, Figure 1 and Remark 15 in Section 10.3.1, and Figure 9 in Section 10.3.2). 
In each case, the DCP solutions suggest that, depending on the processing capacities of the servers 
and the network state, the servers in the network that process more than one job type should either 
give static priority to the more expensive job types or slow the departure process of these more 
expensive jobs in order to sometimes prioritize the less expensive jobs. This prevents unnecessary 
forced idling of the downstream “join” servers that process the less expensive job types, without 
sacrificing the speed at which the more expensive job types depart the network. 

We prove that our proposed policy is asymptotically optimal in heavy traffic for the fork-join 
network in Figure 2. To do this, we first show that the DCP solution provides a stochastic lower 
bound on the holding cost under any policy at every time instant. This is a strong objective, in line 
with that in Ata and Kumar [2005] , which implies asymptotic optimality for minimizing the expected 
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total discounted cost or average cost over a finite time horizon (but does not guarantee asymptotic 
optimality for minimizing average cost over an infinite time horizon without a limit interchange 
result). Then, we prove a weak convergence result that implies the aforementioned lower bound is 
achieved under our proposed policy. That rigorous analysis suggests that our translations of the 
DCP solutions relevant to the more general topologies shown in Figures 1 and 9 should also perform 
well. 

The weak convergence result when the network operates under the SDP control is a major 
technical challenge for the paper. This is because the SDP control is a dynamic control that depends 
on the network state. In order to obtain the weak convergence, we must carefully construct random 
intervals on which we know the job type server 4 is prioritizing. Although this idea is similar in 
spirit to the random interval construction in Bell and Williams [2001], Ghamami and Ward [2013], 
the proof to show convergence on the intervals is much different, due to the desired matching of the 
type a job departure processes from the servers 3 and 4. More specifically, the interval construction 
is determined by tracking and comparing the job counts in buffers 3 and 4, because the SDP policy 
prioritizes type a jobs when the number of jobs in buffer 4 exceeds that in buffer 3, and prioritizes 
type b jobs otherwise. Then, the keys to obtaining the desired weak convergence result are as follows. 

• On the intervals on which type a jobs are prioritized, we must bound the difference between 
two renewal processes having different rates. 

• On the intervals on which type b jobs are prioritized, we require a rate of convergence result 
on a light traffic GI / GI /1 queue that is different than any result we find in the literature (for 
example, the one in Dai and Weiss [2002]). 

Finally, when we piece those intervals together, we see the DCP solution arise. 

The remainder of this paper is organized as follows. We conclude this section with a literature 
review and a summary of our mathematical notation. Section 2 specifies our model and Section 
3 provides our asymptotic framework. We construct and solve an approximating DCP in Section 
4. We introduce the SDP control in Section 5, and specify when the proposed policy is SDP and 
when it is static priority. Section 6 proves that the DCP solution provides a lower bound on the 
performance of any control, and Sections 7 and 8 prove weak convergence under the proposed policy. 
Section 9 provides extensive simulation results. In Section 10, we construct and solve approximating 
DCPs for a broader class of fork-join networks. Section 11 makes concluding remarks and proposes a 
future research direction. We separate out our rate of convergence result for a light traffic GI/GI /1 
queue in the appendix, as that is a result of interest in its own right. We also provide the proofs of 
the results that use more standard methodology as well as more detailed simulation results in the 
appendix. 

1.1 Literature Review 

The inspiration for this work came from the papers Nguyen [1993, 1994], Nguyen [1993] es¬ 
tablishes that a feedforward FCFS fork-join network with one job type and single-server stations 
in heavy traffic can be approximated by a reflected Brownian motion (RBM), and Nguyen [1994] 
extends this result to include multiple job types. The dimension of the RBM equals the number 
of stations, and its state space is a polyhedral region. In contrast to the RBM approximation for 
feedforward queueing networks (Harrison [1996], Harrison and Van Mieghem [1997], Harrison [1998, 
2006]), the effect of the synchronization constraints in fork-join networks is to increase the number 
of faces defining the state space. Also in contrast to feedforward queueing networks, that number 
is increased further when moving from the single job type to multiple job type scenario. Although 
delay estimates for fork-join networks follow from the results of Nguyen [1993, 1994], they leave open 
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the question of whether and how much delays can be shortened by scheduling jobs in a non-FCFS 
order. 

To solve the scheduling problem, we follow the “standard Brownian machinery” proposed in 
Harrison [1996]. This is typically done by first introducing a heavy-traffic asymptotic regime in 
which resources are almost fully utilized and the buffer content processes can be approximated by a 
function of a Brownian motion, and second formulating an approximating Brownian control problem. 
Often, the dimension of the approximating Brownian control problem can be reduced by showing 
its equivalence with a so-called workload formulation (Harrison and Van Mieghem [1997], Harrison 
[2006]). The intriguing difference when the underlying network is a fork-join network is that the join 
servers must be in light traffic to arrive at an equivalent workload formulation. The issue is that 
otherwise the approximating problem is non-linear. This light traffic assumption is asymptotically 
equivalent to the assumption that processing times are instantaneous. Our simulation results suggest 
that our proposed control that is asymptotically optimal when the join servers are in light traffic 
also performs very well when the join servers are in heavy traffic. 

The papers Pedarsani et al. [2014a,b] are some of the few studies we find that consider the control 
of fork-join processing networks. In both papers, there are multiple job classes, but in Pedarsani 
et al. [2014a] the servers can cooperate on the processing of jobs and in Pedarsani et al. [2014b] 
they cannot. Their focus is on finding robust polices in the discrete-time setting that do not depend 
on system parameters and are rate stable. They do not determine whether or not their proposed 
policies minimize delay, which is our focus. The paper Gurvich and Ward [2014] seeks to minimize 
delay, but in the context of a matching queue network that has only “joins” and no “forks”. 

An essential question to answer when thinking about controls for multiclass fork-join networks, 
as can be seen from the papers Lu and Pang [2016, 2015], Atar et al. [2012], is whether or not 
the jobs being joined are exchangeable; that is, whether or not a task forked from one job can 
be later joined with a task forked from a different job. Exchangeability is generally true in the 
manufacturing setting, because there is no difference between parts with the same specifications, 
and generally false in healthcare settings, because all paperwork and test results associated with one 
patient cannot be joined with another patient. The papers Lu and Pang [2016, 2015] develop heavy 
traffic approximations for a non-exchangeable fork-join network with one arrival stream that forks 
into arrival streams to multiple many-server queues, and then must be joined together to produce 
one departure stream. The heavy-traffic approximation for the non-exchangeable network is different 
than for the exchangeable network, and the non-exchangeability assumption increases the problem 
difficulty. Their focus, different than ours, is on the effect of correlation in the service times, and there 
is no control. The paper Atar et al. [2012] looks at a fork-join network in which there is no control 
decision if jobs are exchangeable, and shows that the performance of the exchangeable network lower 
bounds the performance of the non-exchangeable network. Then, they propose a control for the 
non-exchangeable network that achieves performance very close to the exchangeable network. In 
comparison to the aforementioned papers, the exchangeability assumption is irrelevant in our case. 
This is because we assume head-of-line processing for each job type, so that the exact same type a 
( b ) jobs forked from server 1 (2) are the ones joined at server 6 (7). 

1.2 Notation and Terminology 

The set of nonnegative integers is denoted by N and the set of strictly positive integers are denoted 
by N_|_. The k dimensional (k £ N + ) Euclidean space is denoted by M fc , M + denotes [0, +oo), and 0^ 
is the zero vector in M fc . For x, y £ M, x V y := max{x,y}, x A y := min{x,y}, and (x) + := x V 0. 
For any x £ M, [xj (|~x]) denotes the greatest (smallest) integer which is smaller (greater) than or 
equal to x. The superscript ' denotes the transpose of a matrix or vector. 

For each k £ N+, O fc denotes the the space of all uj : M+ —> which are right continuous 


5 


with left limits. Let 0 6 D be such that 0(t) = 0 for all t £ R+. For u £ B and T £ M + , we let 
||w||r := sup 0 <t<T |cu(t)|. We consider endowed with the usual Skorokhod J\ topology (cf. Chapter 
3 of Billingsley [1999]). Let B( D fc ) denote the Borel a-algebra on ID fc associated with Skorokhod J\ 
topology. By Theorem 11.5.2 of Whitt [2002], B( D fc ) coincides with the Kolmogorov er-algebra 
generated by the coordinate projections. For stochastic processes W r , r £ M + , and W whose sample 
paths are in lB> fc for some k £ N+, “W r =>■ VF” means that the probability measures induced by W r 
on (D fc , £>(B fc )) weakly converge to the one induced by W on (B fc , £>(ID fc )) as r —> oo. For x,y £ O, 
x V y, x A y, and (x) + are processes in O such that (x V y)(t) := x(t) V y(t), (x A y)(t) := x(t) A y(t), 
and (x) + (t) := (x(t)) + for all t £ M + . For x £ D, we define the mappings T, 4> : D —y B such that 
for all t > 0, 

T(x)(t) := sup (-x(s)) + , <h(x)(t) := x{t) + T(x)(t), 

o <s<t 

where $ is the one-sided, one-dimensional reflection map, which is introduced by Skorokhod [1961]. 

Let Z = {1, 2,... , n} and AQ be a process in ID for each i £ Z. Then (X t , i £ Z) denotes the 
process (X\,X 2 , ■ ■ ■ ,X n ) in D n . We denote e as the deterministic identity process in ID such that 
e(t) = t for all t > 0. We abbreviate the phrase “uniformly on compact intervals” by “u.o.c.”, 

“almost surely” by “a.s.”. We let -—>■ denote almost sure convergence and = denote “equal in 
distribution”. I denotes the indicator function and BM q {6, E) denotes a Brownian motion with drift 
vector 6 and covariance matrix E which starts at point q. The big-0 notation is denoted by O(-), 
i.e., if x : M+ —>• M and y : M+ M are two functions, then x(t) = 0(y(t)) as t —» oo if and only if 
there exist constants C and to such that \x(t)\ < C\y(t)\ for all t > to- Lastly, o p (-) is the little-o in 
probability notation, i.e., if {X n ,n £ N} and {Y n ,n £ N} are sequences of random variables, then 
X n = o p (Y n ) if and only if |X n |/|Y^| converges in probability to 0. 

2 Model Description 

We consider the control of the fork-join processing network depicted in Figure 2. In this network, 
there are 2 job types, 7 servers, 10 buffers, and 8 activities. The set of job types is denoted by J , 
where J = {a, 6} and a and b denote the type a and type b jobs, respectively. The set of servers is 
denoted by S , where S = {1, 2, ..., 7}. The set of buffers is denoted by 1C, where K, = { 1 , 2 ,..., 10}, 
and the set of activities is denoted by A where A = {1, 2, 3, A, B, 5, 6, 7}. Except for server 4, each 
server is associated with a single activity. Server 4 is associated with two activities, denoted by A 
and B, which are processing type a jobs from buffer 4 and type b jobs from buffer 5, respectively. 
Both server 6 and server 7 deplete jobs from 2 different buffers. Note that these two servers are 
join servers and process jobs whenever both of the corresponding buffers are nonempty. Hence, both 
server 6 and server 7 are associated with a single activity, namely activities 6 and 7, respectively. 
Let s : A —> S be a function such that s(j ) denotes the server in which activity j, j £ A takes 
place. Let / : AL\{1, 2} —> A be a function such that f(k) denotes the activity which feeds buffer k, 
k £ /C\{1, 2}. Lastly, let d : 1C —> A be a function such that d(k) denotes the activity which depletes 
buffer k, k £ 1C. For example, s(H) = s(B) = 4, /(4) = 1 and d( 4) = A. 

2.1 Stochastic Primitives 

We assume that all the random variables and stochastic processes are defined in the same complete 
probability space (H, T, P), E denotes the expectation under P, and P(A, B) := P(H fl B). 

We associate the external arrival time of each job and the process time of each job in the cor¬ 
responding activities with the sequence of random variables {vj(i),j £ J U and the strictly 

positive constants {A j,j £ J} and {/ij, j £ A}. We assume that for each j £ J U A, {vj(i)}^ 1 
is a strictly positive, independent and identically distributed (i.i.d.) sequence of random variables 
mutually independent of for all k £ {J U*4)\{j}, E[vj(l)] = 1, and the variance of Vj( 1), 
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denoted by Var('Dj(l)), is aj . For j £ J, let Vj(i) := Vj(i)/\j be the interarrival time between the 
(i — l)st and ith type j job. Then, A j and aj are the arrival rate and the coefficient of variation 
of the interarrival times of the type j jobs, where j £ J. For j £ {1,3, A, 6} (j £ {2,R,5,7}), let 
Vj(i) := be the service time of the ith incoming type a ( b ) job associated with the activity 

j. Then, fij and oj are the service rate and the coefficient of variation of the service times related 
to activity j. j £ A. For each j € J U A, let Vj(0) := 0 and 

n 

Vj{n) := Vn £ N+, Sj(t) := sup{n £ N : Vj(n) < t}. (1) 

i =1 

Then, Sj is a renewal process for each j £ J U A. If j £ J. Sj(t ) counts the number of external 
type j arrivals until time t\ if j £ A, Sj(t ) counts the number of service completions associated with 
activity j until time t given that the corresponding server works continuously on this activity during 
[0 ,t]. “ 

2.2 Scheduling Control and Network Dynamics 

Let Tj(t), j £ A, denote the cumulative amount of time server s(j) devotes to activity j during 
[0,t]. Then, a scheduling control is defined by the two dimensional service time allocation process 
(T a ,T b ). Although a scheduling control indirectly affects (Tq,T^), since we do not have any direct 
control on servers 6 and 7, we exclude (Tq,T^) from the definition of the scheduling control. Let, 

■= t - Tj(t), j £ A\{A,B}, (2a) 

h(t):=t-T A (t)-T B (t), (2b) 

denote the cumulative idle time of the servers during the interval [0,t], For any j £ A, Sj(Tj(t )) 
denotes the total number of service completions related to activity j in server s (j ) up to time t. For 
any k £ 1C, let Qk(t) be the number of jobs waiting in buffer k at time t, t > 0, including the jobs 
that are being served. Then, for all t > 0, 

Qi(t) := S a (t) - SrCZK*)) > 0, Q 2 (t) := S b (t) - S 2 (T 2 (t )) > 0, (3a) 

Qk(t) := Sf( k )(Tf( k )(t)) - Sd( k )(T d ( k )(t)) >0, k € /C\{1,2}. (3b) 

For simplicity, we assume that initially all buffers are empty, i.e., Q k (0) = 0 for all k £ 1C. Later, we 
relax this assumption in Remark 9. 

We have 


Vj(S j (T j (t))) < Tj(t) < V j (S j (T j (t)) + 1), for all j £ A and t > 0, (4) 

which implies that we consider only head-of-the-line (HL) policies, where jobs are processed in FCFS 
order within each buffer. In this network, a task associated with a specific job cannot join a task 
originating in another job under the HL policies; that is, recalling our literature review (cf. Section 
1.1), the notion of exchangeability is not present. 

It is straightforward to see that work conserving policies are more efficient than the others in this 
network. Hence, we ensure that all of the servers work in a work-conserving fashion by the following 
constraints: For all t > 0, 

Ij(-) is nondecreasing and I 3 (0) = 0 for all j £ S, 

Is{d(k))(t) increases if and only if Q k (t) = 0, for all k £ {1,2,3,6}, 

Ii(t) increases if and only if Qi(t) V Q${t) = 0, 
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(5a) 

(5b) 

(5c) 


Ie(t) increases if and only if Qj(t) A Qs(t) = 0, (5d) 

/7(f) increases if and only if Qg(t) A Qio(t) = 0. (5e) 

A scheduling policy T := (Ta,Tb) is admissible if it satisfies the following conditions: For any 
Tj, i € A, Ij, j G S , and Qk, k G 1C satisfying (2), (3), (4), and (5), 

Tj(t) G J 7 , Vi > 0 and j G {A,B}, (6a) 

Tj(-) is continuous and nondecreasing with Tj( 0) = 0, V) G {A,B}, (6b) 

I±{-) is continuous and nondecreasing with h(0) = 0. (6c) 


Conditions (6a)-(6c) imply that we consider a wide range of scheduling policies including the ones 
which can anticipate the future. 

2.3 The Objective 

A natural objective is to minimize the discounted expected total holding cost. Let h a and hi, 
denote the holding cost rate per job per unit time for a type a and b job, respectively; and 5 > 0 be 
the discount parameter. Moreover, let 


Z a (t) Q3W + Q4W + Q 7(0 + Qs(i)> Zb{t) Q§{t) + Qe(t) + Q$(t) + Qio(t). 


Then Z a (t ) and Zj,(t) denote the total number of type a and b jobs in the system except jobs waiting 
in buffers 1 and 2. Since Q\{t) and Qiit) are independent of the scheduling policy, we exclude these 
processes from the definitions of Z a {t) and Zb(t). Then, the expected total discounted holding cost 
under admissible policy T is 


Jt = E 



(■ h a Z a (t) + h b Z b (t )) d t , 


(7) 


and our objective is to find an admissible policy which minimizes (7). Another natural objective is 
to minimize the expected total cost up to time t, t G R + , which is 


J T = E 


(■ h a Z a (s) + h b Z b (s))ds 


( 8 ) 


Yet another possible objective is to minimize the long-run average cost per unit time, 

J T = limsup — E 

t —>00 t 

We focus on a more challenging objective which is minimizing 


(h, 


a^a^s) “t - h b Z b {s)) ds 


(9) 


P ( h a Z a (t ) + h b Z b (t) > x ), for all t G M + and x > 0 . (10) 

It is possible to see that any policy that minimizes (10) also minimizes the objectives (7), (8), and 
(9). 

In this specific network, for all t > 0 


Qs(t) + Qi(t) = Q±(t) + Q&(t), Qb{t) + <2g(£) = Q${t) + Qio(t). (11) 

By (11), a policy is optimal under the objective (10) if and only if it is optimal under the objective 
of minimizing 


p (h a ( Qs(t ) + Qi(t)) + h b (Qe(t) + Qio(t)) > x) , for all t G M+ and x > 0 . (12) 


We will focus on the objective (12) from this point forward. 








3 Asymptotic Framework 

It is very difficult to analyze the system described in Section 2 exactly. Even if we can accomplish 
this very challenging task, it is even less likely that the optimal control policy will be simple enough 
to be expressed by a few parameters. Therefore, we focus on finding an asymptotically optimal 
control policy under diffusion scaling and the assumption that server 4 is in heavy traffic. We first 
introduce a sequence of fork-join systems and present the main assumptions done for this study in 
Section 3.1. Then we formally define the fluid and diffusion scaled processes and present convergence 
results for the diffusion scaled workload facing server 4 and the diffusion scaled queue length processes 
associated with servers 1, 2, 3, and 5 in Section 3.2. The question left open is to determine what 
should be the relationship between the control and the number of type a and b jobs in the workload 
facing server 4. 

3.1 A Sequence of Fork-Join Systems 

We consider a sequence of fork-join systems indexed by r where r —> oo through a sequence 
of values in R + . Each queueing system has the same structure defined in Section 2 except that 
the arrival and service rates depend on r. To be more precise, in the rth system, we associate the 
external arrival time of each job and the process time of each job in the corresponding activities 
with the sequence of random variables {vj(i). j £ J U A}“ 1 , which we have defined in Section 2.1, 
and the strictly positive constants {A r j,j £ J} and {//£, j £ A} such that Vj(i) := Vj(i)/Xj, j £ J is 
the interarrival time between the (i — l)st and ith type j job and Vj(i) := (i )/, j £ {1,3, A, 6} 

(j £ {2,B,5, 7}) is the service time of the ith incoming type a ( b ) job associated with the activity 
j in the rth system. Therefore, A{', j £ J and / i r -, j £ A are the arrival rates and service rates in 
the rth system whereas the coefficient of variations are the same with the original system defined 
in Section 2. From this point forward, we will use the superscript r to show the dependence of the 
stochastic processes to the rth queueing system. 

Next, we present our assumptions related to the system parameters. The first one is related to 
cost parameters. 

Assumption 1 Without loss of generality, we assume that h a ^A > hbp-B- 

This assumption implies that it is more expensive to keep type a jobs than type b jobs in server 4. 
Second, we make the following assumptions related to the stochastic primitives of the network. 

Assumption 2 There exists a non-empty open neighborhood, O, around 0 such that for all a £ O, 

E[e Q ^(i)] < 00j f or a u j € J u A. 

Assumption 2 is the exponential moment assumption for the interarrival and service time processes. 
This assumption is common in the queueing literature, cf. Harrison [1998], Bell and Williams [2001], 
Maglaras [2003], Meyn [2003]. 

Our final assumption concerns the convergence of the arrival and service rates, and sets up heavy 
traffic asymptotic regime. 

Assumption 3 

1. \ r j —>• \j > 0 for all j £ J as r -A- oo. 

2. iTj —>• p,j > 0 for all j £ A as r —> oo. 

3- A a /ha + A b/hB = 1- 
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4 - r (A a/H a + A — 1) -» 9 a / ha as r —> oo, where 64 is a constant in R. 
5. As r —> 00, 


r(X r a — Hi) —>• 0 i G R U {—00}, r{X r b — H 2 ) —* #2 G R U {—00}, ( 13 a) 

r( X r a — H3) —>• #3 € R U {—00}, r(X r b — nl) —» #5 G R U {—00}. ( 13 b) 

6. X a < H6 and Xb < H7- 

7 . Ifh a HA = hbHB, there exists an ro G R + such that h a /j, r A > hbhB f or a ^ r — r o- 

Parts 3 and 4 of Assumption 3 are the heavy traffic assumptions for server 4. Note that, if server 4 
was in light traffic, its processing capacity would be high, thus any work-conserving control policy 
would perform well. Moreover, as explained in the emergency department example in Section 1, 
server 4 represents expensive and limited resources (e.g. a CT scanner) which needs to process 
multiple job types, thus it is expected from server 4 to be in heavy traffic in such a setting. By Parts 
1, 2, and 3, we have ha > A a and hb > A&. Part 5 states that each of the servers 1, 2, 3, and 5 
can be either in light or heavy traffic. On the one hand, if Qi = —00 for some i G {1,2,3,5}, then 
server i is in light traffic. On the other hand, if 6 i G R, then server i is in heavy traffic. Part 6 states 
that the two join servers are in light traffic. Note that, Atar et al. [2012], Gurvich and Ward [2014], 
Lu and Pang [2016, 2015] assume that the service processes in the join servers are instantaneous. 
Hence, Part 6 of Assumption 3 extends the assumptions on the join servers made in the literature. 

Lastly, Part 7 of Assumption 3 is done for technical reasons that occur only when h a HA = hbHB (cf. 

Section 6.1). 

For simplicity, we assume that 

Q r k ( 0) = 0, for all k G /C and r G R+. 

Later, we relax this assumption in Remark 9. Assumptions 1, 2, and 3 are assumed throughout the 
paper. 

3.2 Fluid and Diffusion Scaled Processes 

In this section, we present the fluid and diffusion scaled processes. For all t > 0, the fluid scaled 
processes are defined as 


S^(f) := r- 2 5J(r 2 t), j'GjUi, 


T]{t) := r ~ 2 Tj (r 2 t), j G A, 

(14a) 

Qj(t) := r- 2 Q r j(r 2 t), j G K, 


Tj(t) := r _2 /J(r 2 t), j G S, 

(14b) 

the diffusion scaled processes are defined as 




Sj(t) := r _1 (Sj (r 2 t) - A][r 2 t) , j G J, 


= r -1 (Sj (r 2 t) - H r f 2 t) , j G A, 

(15a) 

Tj(t) := r-'T^rh), j G A, 


= r _1 IJ(r 2 t), j G S, 

(15b) 

Qj(t) := r^Qji^t), j€fC. 



(15c) 


By the functional central limit theorem (FCLT), cf. Theorem 4.3.2 of Whitt [2002], we have the 
following weak convergence result. 

(s r j ,j€jUA)=>(s i ,j€jUA), (16) 
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where Sj is a one-dimensional Brownian Motion for each j G JV) A such that Sj = BMq(0, A j<j 2 ) for 

j G J , Sj = BMq( 0, pja 2 ) for j G A and each Sj is mutually independent of Si, i G (J U M)\{j}. 
For t > 0, let us define the workload process (up to a constant scale factor) 

w;(t) ■■= «(*) + 4«(*)- (in 

Ub 

Then, W 4 (t)/n A is the expected time to deplete buffers 4 and 5 at time t, if no more jobs arrive 
after time t in the rth system. Let W\{t ) := r~ 2 Wf(r 2 t) and W[(t) := r~ 1 Wf(r 2 t) denote the fluid 
and diffusion scaled workload processes, respectively. 

Next, we present the convergence of the fluid scaled processes under any work-conserving policy. 

Proposition 1 Under any work-conserving policy (cf. (5)), as r —> oo 

(QL k G K, W 4 , T r j,j £4)^) (Q fc , k€JC, W 4 , Tj, j G A) u.o.c., 

where Q k = 0 for all k G 1C, W 4 = 0, and Tj(t) = (A a /pj)t for j G {1,3, A, 6} and Tj(t ) = (A b/hj)t 
for j G {2, B, 5, 7} for all t > 0. 


The proof of Proposition 1 is presented in Appendix B.l. We will use Proposition 1 to prove 
convergence results for the diffusion scaled processes. 

Considering Assumption 3 Part 5, let 7i := {i € {1, 2, 3, 5} : 0i G M}, i.e., Ti is the set of servers 
which are in heavy traffic among the servers 1, 2, 3, and 5. Let \H\ be the cardinality of the set 7~i, 
and for each i G {1,2, 3, 5}, let 


Xi ■= 


1, if server i is in heavy traffic, i.e., i G hi, 

0, if server i is in light traffic, i.e., i 7i. 

For all t > 0, let 

S[(f) := Xi A (TiW) - r(K - A)t) + (1 - xi) (§») - «(« 

Sim ~ X2 (s; (r 2 (ij) - r(A i - „ r 2 )t) + (i - X 2 ) (sa«) - «(*)). 

Then, by using (18), we define the so-called U netput process” for each buffer as 
xm ■■= SI(t) - sr {k) (r d{k) (t)) +r(A[ for (k,l) G {(1, a), (2, 6)}, 

X r k {t) := Sift) -S r m [T r d{k) {t)) +r(A{ -p\ k) )t, for (k,l,i) G {(3,1, a), (6, 2, b)}, 

' Ar A,; 


(18a) 

(18b) 

(19a) 

(19b) 


X r k (t) ■■= SJ(t) - S r d(k) {T\ k) {t)) + rpf d{k) 


d(k) Ud(k) 


t, for {k, l, i) G {(4,1, a), (5, 2, b)}, 

(19c) 


XUt) ■= Si(t ) - Q'(t) - Q'(t) - (T d(k) (t)) + r(A[ - ^ w )t, 

for (M,U) e {(7,n,l,3), (8, a, 1,4), (9,6.2, 5), (10, b, 2,6)). (19d) 


Let 


Q r := 


'Q\ 


X[ 


T 

Q r 2 


X r 2 



Ql 

, X r : = 

XI 

ii 

Tf 

l 3 

Ql 


Xl r ^ 


U 

fb 

wi 


vr ^ a yr 

I/M + Mb 5 -l 


fr 

L J 4j 


( 20 ) 
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e : = 


( 21 ) 


0i 


Mi 

0 

0 

0 

0 ' 

02 


0 

V 

M 2 

0 

0 

0 

CO 

1 

>< 

, RT:= 

-X 1 M 1 

0 

ul 

0 

0 

#5 - X2#2 


0 

-X 2 M 2 

0 

r 

m 5 

0 

fli ~ X101 - X2J^0 2 _ 


-Xi hi 

M A V 

-X2^M 2 

0 

0 

T 

Ma 


and let R be a 5 x 5 matrix which is the component-wise limit of R r . Then, we have 


Q r = X r + R r I r , 


Let us define 


Ql=Xl + f d{k y 


s(d(fc))> 


k G {7,8,9,10}. 



1 

2 

3 

6 

4 

1 

( A a(0-a + 0-f) 

0 

—A a crf 

0 

-A a crj \ 

2 

0 

A + al) 

0 

-Aftfj| 

-^A 6<r? 

UB 0 1 

3 

-AaCJ 

0 

Cov(3, 3) 

0 

Cov(3,4) 

6 

0 

-Aftcr| 

0 

Cov(6,6) 

Cov(4, 6) 

4 

^ A a cr{ 

-^Aftd 1 
fL B 0 Z 

Cov(3,4) 

Cov(4,6) 

Cov(4,4) j 


where 


( 22 ) 

(23) 


(24) 


Cov(3, 3) := A a (xicrf + (1 - Xi)^l + , Cov(3,4) := A a (xia\ + (1 - %i)cr^) , 

Cov(4, 6) := —A 6 (x 2 <72 + (1 - X 2 )0ft) , Cov(6,6) := X b (xiaj + (1 - X 2 )<?l + of) , 

Cov(4,4) := A a (xiej + (1 — Xi ) a l + a A ) + (jj ~^ Aft (X202 + (1 — X' 2 Vft + &%) . (25) 


Then, we have the following weak convergence result. 


Proposition 2 Under any work-conserving policy (cf. (5)}, 

(< Q r i,Q r 2,Q r 3 ,Qlw x) =► (Qi,Q2,Qs,Q e ,w 4 ), (26) 

where Qi = 0 /or eac/i i R and ( Qi,i G R, W4) is a semimartingale reflected Brownian motion 
(SRBM) associated with the data (P^,Gy,Ty, Ry,0^). P\ B \ is the nonnegative orthant in 

is an |%|-dimensional vector derived from the vector 6 (cf. (21) ) by deleting the rows correspond¬ 
ing to each i, i £ R; 'Ey and Ry are \R\ x \R\-dimensional matrices derived from E (cf. (24) ) and 
R (cf. (21)} by deleting the rows and columns corresponding to each i, i fiR, respectively; and 0|^| 
is the origin in P\y\- 

The state space of the SRBM is P\y\l @y and Ey are the drift vector and the covariance matrix 
of the underlying Brownian motion of the SRBM, respectively; Ry is the reflection matrix; and 0|^| 
is the starting point of the SRBM. 


The formal definition of an SRBM can be found in Definition 3.1 of Williams [1998b], The proof 
of Proposition 2 is presented in Appendix B.2. 
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4 The Approximating Diffusion Control Problem 

In this section, we construct an approximating diffusion control problem (DCP) with non-rigorous 
mathematical arguments. However, the solution of the DCP will help us to guess a heuristic control 
policy. Then, we will prove the asymptotic optimality of this heuristic control policy rigorously in 
Sections 5, 6, 7, and 8. 

Parallel with the objective (12), consider the objective of minimizing 

P (ha W + + hb ^Qg(t) + Q[ 0 (t)j > 7 Vt > 0, x > 0, (27) 

for some r. Motivated by Assumption 3 Part 6, let us pretend that the service processes at servers 
6 and 7 happen instantaneously. Since the diffusion scaled queue length process weakly converges to 
0 in a light traffic queue, we believe that considering instantaneous service rates in the downstream 
servers will not change the behavior of the system in the limit. In this case, jobs can accumulate in 
buffer 7 (8) only at the times buffer 8 (7) is empty. Similarly, jobs can accumulate in buffer 10 (9) 
only at the times buffer 9 (10) is empty. By this fact and (11), 

Q r 7 = (Ql~Q r 3) + , Ql = (Q r 3-Ql) + , Q r 9 = (Q r 6 -Ql) + , Q r 10 = {Ql - Ql) + • (28) 

By (28), objective (27) is equivalent to minimizing 

P (h a (qw) + ( Ql(t ) - Q r 3 (t)) + ^J + h b ( Q r 6 (i ) + (g r 5 (t) - Q£(t)) + ) > x ), Vt > 0, x > 0. 

^ ^ ^ . (29) 

From the objective (29), we need approximations for Q 3, Q 5, and Q g and we will achieve this 
goal by letting r —> 00. 

By (15c), (17), and Proposition 2, we know that (Q 3 ,Qq,) weakly converges to ( Q 3 ,Qe ) and 
Qa + weakly converges to W 4 . At this point, let us assume that 

(qi,qi) => (Qa,Q 5 ). 

Then we construct the following DCP: For each x > 0 and t > 0, 

min P (h a ^Q 3 (t) + (Qa^) - Q 3 (t)^ ^ + h b Q 6 (t ) + (Q 5 (t) - Q 6 (*)) ) > x ) ’ 

s.t. QA(t) + = W 4 (t), (30) 

Hb 

Qk(t) > 0, for all k € {4, 5}. 

Intuitively, we want to minimize the total cost by splitting the total workload in server 4 to buffers 4 
and 5 in the DCP (30). Now, we will consider DCP (30) path-wise. Let u (ui £ D) denote a sample 
path of the processes in DCP (30) and for any F : D —> D, F(ojt) denote the value of the process F 
at time t in the sample path uj. Then, consider the following optimization problem for each uj € D 
and t > 0. 

min h a (Qa^) ~ Q 3 (w t )) + h b (Q 5 (ui t ) - , (31a) 

s.t. Q 4 (wi) + —Q 5 (u) t ) = W 4 (ujt), (31b) 

Hb 

Qk{oj t ) > 0, for all k € {4, 5}. (31c) 
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Note that, we exclude the term h a Q3{u)t)+h b Q§{ujt) from the objective function (31a) because Q 3 (cut) 
and Q§(u)t) are independent of the decision variables Q 4 {uit) and Q b {ut). Although the objective 
function (31a) is nonlinear, the optimization problem (31) is easy to solve because it has linear 
constraints. Moreover, Lemma 1, provides a closed-form solution for (31). 

Lemma 1 Consider the optimization problem 

min h a {q 4 - g 3 ) + + h b ( q 5 - q 6 ) + , 

HA 

s.t. q 4 H- q 5 = w 4 , 

HB 

<?4 > o, q 5 > 0, 

where q 4 and q§ are the decision variables, all of the parameters are nonnegative, and h a HA > h b HB- 
Then (/ 4 = g 3 A w 4 and q b = {hb / Ha){w 4 — qs) + is an optimal solution of this problem. 

Pf: Replacing q 4 with w 4 — (ha/hb)Q5 gives us the following equivalent optimization problem 
which has only one decision variable. 


mm 


s.t. 


h a (w 4 - 


HA 

—<?5 - 

Hb 

o ^ ^ Tb 

0 < q 5 < — w 4 . 

HA 
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+ h ( q 5 - q 6 y 


(32a) 

(32b) 


We will solve the optimization problem (32) case by case. First, consider the case in which w 4 < g 3 . 
If q b = 0, then objective function value becomes 0, which is the lowest possible objective function 
value of the problem (32). Therefore, q 4 = w 4 and q§ = 0 is an optimal solution when w 4 <q%. 

Second, consider the case where w 4 > q%. In this case, first consider the following subcase: 
{hb/Ha){w 4 — 13 ) < 16 - I n this case, if q§ = {hb/Ha){w 4 — g 3 ), then the objective function value 
becomes 0. Thus, q 4 = g 3 and q b = {hb / Ha){w 4 — <? 3 ) is an optimal solution when w 4 > <? 3 and 
{hb/Ha)(w 4 — 13 ) < 16- Lastly, consider the subcase in which {hb/Ha){u >4 — 13 ) > 16- Figure 3 
illustrates the change of the objective function value as q b increases from 0 to its upper bound, 
{hb/ha)w 4. 

Note that, when <75 = q^, the objective function value is h a (w 4 — {ha/hb)qg ~ 13 )- At this point, 
increasing q§ by a small amount, say e, will change the objective function value by {—h a HA/ HB + h b )e, 
which is a negative number because h a HA > h b HB- When <75 = {hb/Ha){w 4 — <? 3 ), increasing <75 will 
increase the objective function value with the rate h b . Therefore, as seen in Figure 3, q 4 = g 3 and 
15 = {hb/Ha){w 4 — < 73 ) is an optimal solution when w 4 > g 3 and {hb/Ha){w 4 — (? 3 ) > q§. ■ 

Therefore, by Lemma 1, we see that an optimal solution of the optimization problem (31) is 

Q 4 {oj t ) = W 4 (uj t ) A Q 3 (u;t) and Q b {u t ) = ( Hb/ HA) (w 4 {ui t ) - for all w and t > 0. This 

result and (28) imply the following proposition. 

Proposition 3 

(Qa, Q5, Qi, Qs 5 Q9 , Q10) 

= (q 3 A w 4 , ^ (w 4 - q 3 ) + , 0, (g 3 - w 4 y , 


Qe> — 


Hb 

Ha 


W 4 - 


q 3 ) 


Hb 

Ha 


W 4 - 


QsJ — Qd 


(33) 


is an optimal solution of the DCP (30). 
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Objective function value 



Figure 3: Objective function value of the problem (32) with respect to different q$ values. 


Note that the right hand side of (33) is independent of the scheduling policies by Proposition 2. 
Therefore, a control policy in which the corresponding processes weakly converge to the right hand 
side of (33) is a good candidate for an asymptotically optimal policy. 

The DCP solution in Proposition 3 matches the content level of buffer 4 to that of buffer 3, 
except when the buffer 3 content level exceeds the total work facing server 4 (that is, the combined 
contents of buffers 4 and 5). This ensures that server 4 never causes server 6 to idle because of the 
join operation, as is evidenced by the fact that buffer 7 is always empty whereas buffer 8 sometimes 
has a positive content level. At the same time, server 4 prevents any unnecessary idling of server 7 by 
devoting its remaining effort to processing the contents of buffer 5. Sometimes, that effort is sufficient 
to “keep up” with server 5 and prevent the contents of buffer 10 from building and sometimes it is 
not. That is why sometimes buffer 9 has a positive content in the DCP solution and sometimes buffer 
10 does (but never both simultaneously). In the next section, we formally introduce the proposed 
policy. 

5 Proposed Policy 

Our objective is to propose a policy under which the diffusion scaled queue-length processes track 
the DCP solution given in Proposition 3. This is because the DCP solution provides a lower bound 
on the asymptotic performance of any admissible policy, as we will prove in Section 6 (see Theorem 
2 ). 

The key observation from the DCP solution in Proposition 3 is that there is no reason for the 
departure process of the more expensive type a jobs from server 4 to exceed that of server 3. Instead 
of ever letting server 4 “get ahead”, it is preferable to have server 4 work on type b jobs, so as to 
prevent as much forced idling at server 7 due to the join operation as possible. The only time there 
should have been more cumulative type a job departures from server 4 than from server 3 is when 
the total number of jobs facing server 4 is less than that facing server 3. In that case, server 4 can 
outpace server 3 without forcing additional idling at server 7. 

The intuition in the preceding paragraph motivates the following departure pacing policy, in 
which server 4 gives priority to type a jobs when the number of type a jobs in buffer 4 exceeds that 
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in buffer 3 and gives priority to the type b jobs in buffer 5 otherwise. 

Definition 1 Slow Departure Pacing (SDP) Policy. The allocation process ( Ta,Tb) satisfies 


POO 

/ l(Q 3 (t)<Qfit))d(t-T A (t))=0, 

Jo 

(34a) 

POO 

/ i(Q 3 (t)> Qfit), Qfit) >0)d(t-T B (t)) = 0, 

Jo 

(34b) 

roo 

/ I (Q3(*) > Qfit), Qfit) + Qfit) > 0) d Ifit) = 0, 

Jo 

(34c) 


together with (2), (3), (4), and (5). It is possible to see that ( Ta,Tb) that satisfies (34) also satisfies 
(6), and so is admissible. If Qfit) < Qfit), (34a) ensures that server 4 gives priority to buffer 4 ■ If 
Qfit) > Qfit) and Qfit) > 0, (34b) ensures that server 4 gives priority to buffer 5. (34c) ensures a 
work-conserving control policy in server 4 when Qfit) > Qfit). 

When p% < pa, so that server 3 is the slower server, we use the slow departure pacing policy to 
determine when server 4 can allocate effort to processing type b jobs without increasing type a job 
delay. Otherwise, when P 3 > p a, there is almost never extra processing power to allocate to type b 
jobs, and so a static priority policy will perform similarly to the slow departure pacing policy (see 
Remark 11 regarding our numerical results). 

Definition 2 Static Priority Policy. The allocation process ( Ta,Tb) satisfies 


POO 

/ I(Qfit) >0)d(t-T A (t)) =0, 

Jo 

(35a) 

POO 

/ I (Qfit) + Qfit) > 0) dlfit) = 0, 

Jo 

(35b) 


together with (2), (3), (4), and (5). It is possible to see that ( Ta,Tb) that satisfies (35) also satisfies 
(6), and so is admissible. (35a) ensures that server 4 gives static priority to buffer 4 and (35b) 
ensures a work-conserving control policy in server 4- 

The proposed policy is the SDP policy in Definition 1 when p$ < pa and is the static priority 
policy in Definition 2 when p 3 > pa • We have the following weak convergence result associated with 
the proposed policy. 


Theorem 1 Under the proposed policy, 

(QfikeJC, Wl 


i^Qki 


k e K , W 4 


where ^Qi, Q 2 , Q 3 , Qe, is defined in Proposition 2 and (Q 4 ,Q 5 ,Q 7 ,Q 8 ,Q 9 ,Qio^) is defined i 
Proposition 3. 


The proof of Theorem 1 is presented in Section 7. Theorem 1, the continuous mapping theorem 
(see, for example Theorem 3.4.3 of Whitt [2002]), and Theorem 11.6.6 of Whitt [2002] establish the 
asymptotic behavior of the objective function (12) under the proposed policy. 


16 


Corollary 1 Under the proposed policy, for all t > 0 and x > 0, we have 
mnP (h a (Q 3 (t) + Qj(t)^ + hb (Qe(t) + Qio(t)) > x > 


= P (h a Q 3 (t) + h b ( Q 6 (t) + (w 4 {t) - Q 3 (t)j - Q 6 (t)) 


> x 


(36) 


Remark 1 The proposed policy is a preemptive policy. However, it is often preferred to use a non- 
preemptive policy. To specify a non-preemptive policy, we must specify which type of job server 4 
chooses to process each time server 4 becomes free, and there are both type a and type b jobs waiting 
in buffers 4 and 5. The non-preemptive version of the SDP policy has server 4 choose to serve a type 
a job when the number of jobs in buffer 4 exceeds that in buffer 3, and to serve a type b job otherwise. 
The non-preemptive version of the static priority policy has server 4 always choose a type a job. We 
expect the performance of the non-preemptive version of our proposed policy to be indistinguishable 
from our proposed policy in our asymptotic regime, and we verify that the former policy performs 
very well by our numeric results in Section 9. 

Remark 2 The non-preemptive version of the proposed policy (cf. Remark 1) is easy to implement 
in practice. The system controller does not need know any X j, j £ J or gj, j £ A but he needs to 
know whether h a gA > h b gs or not in order to determine the job type that needs priority. Without 
loss of generality, suppose that the system controller knows that h a pLA > h b gB- Then, he needs 
to know whether g 3 < gA or not in order to determine which policy to implement among the non- 
preemptive versions of the SDP and static priority policies. Note that the simulation experiments (cf. 
Section 9.2) show that the nonpreemptive SDP policy performs well even when g 3 > g a- Thus, the 
nonpreemptive SDP policy is a safe choice when it is not clear whether g 3 < g a or not. In order to 
implement this policy, the system controller only needs to keep track of the number of jobs in buffers 
3 and 4 at the service completion epochs in server 4■ The implementation of the nonpreemptive static 
priority policy is even more trivial. The system controller also needs to know whether servers 6 and 
1 are in light traffic or not (cf. Part 6 of Assumption 3). However, since the simulation experiments 
(cf. Section 9.2) show that the nonpreemptive version of the proposed policy performs well even when 
at least one of the downstream servers are in heavy traffic, the latter required information is not 
very crucial in the implementation of the this policy. Lastly, whether server 4 is in light or heavy 
traffic does not affect the performance of the non-preemptive version of the proposed policy, because 
any work-conserving policy performs good when server 4 is in light traffic and the proposed policy is 
asymptotically optimal when server 4 is in heavy traffic (cf. Section 6). 

Remark 3 In the classical open processing networks, if a server is in light traffic, then the corre¬ 
sponding diffusion scaled buffer length process converges to 0 (see Theorem 6.1 of Chen and Man- 
delbaum [1991]). However, we see in Theorem 1 that although servers 6 and 7 are in light traffic, 
Qg and Q io are non-zero processes (moreover Qs is a non-zero process when server 3 is in heavy 
traffic). Therefore, even though a join server has more than enough processing capacity, significant 
amount of jobs can accumulate in the corresponding buffers due to the synchronization requirements 
between the jobs in different buffers. This makes the control of fork-join networks more challenging 
than the one of classical open-processing networks. 

Remark 4 We will prove that the SDP and the static priority policies are asymptotically optimal 
when X a < g 3 < g-A and g 3 > X a , respectively (cf. Sections 7.1 and 7.2 and Remark 6). Hence, 
both SDP and static priority policies are asymptotically optimal when X a < g 3 < gA and this implies 
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that there are many other asymptotically optimal control policies in addition to the one that we 
propose. However, the simulation experiments show that the static priority policy performs poorly 
when X a ~ H 3 but the SDP policy performs very well even when H 3 > ha (cf. Section 9.2). Therefore, 
the performance of the proposed policy that we construct is robust with respect to the parameters X a , 
H 3 , and ha, which has more practical appeal. 

In the following section, we formally define asymptotic optimality and prove that the proposed 
policy is asymptotically optimal given Theorem 1 and Corollary 1. 

6 Asymptotic Optimality 

In this section, we prove that the DCP solution (cf. Proposition 3) is a lower bound for all 
admissible policies with respect to the objective function ( 12 ), i.e., we have the following result. 

Theorem 2 Let {T r ,r > 0} be an arbitrary sequence of admissible policies. Then for all t > 0 and 
x > 0, we have 

limmfP (h a ( Q r 3 (t ) + Q r 7 (t)j + h b ( Q r 6 {t ) + Q 10 W) > x ) 

~ Qeitf'j ^ > ' ( 37 ) 

Theorem 2 together with Corollary 1 state that the proposed policy is asymptotically optimal. 

Remark 5 In Section 2.2, we state that the objective (12) implies the objectives (7), ( 8 ), and (9). 
Although (36) and (37) imply asymptotic optimality of the sequences of control policies with respect 
to the objectives (7) and ( 8 ), they do not necessarily imply the same result related to the objective 
(9). That is because we need to change the order of the limits with respect to r and t to get the 
desired result. However, for that, we need additional results such as uniform convergence of the 
related processes. 

6.1 Proof of Theorem 2 

Let us consider the term in the left hand side of (37). By (11) and the fact that Q r k > 0 for all 
k £ 1C, we have 

Q r At) > mt ) - Ql(t)) + , Q[ 0 (t) > (Ql(t) - Ql(t)) + , Vt > 0. 

Therefore, it is enough to prove for all t > 0 and x > 0, 

inf p (h a Uflit) + ( Ql(t ) - QKt)') ^ +h b (^Q r 6 {t) + ( Q r 5 (t ) - Q6(*)) + ) > 

~ ■ (38) 

By (16), Proposition 1, and Theorem 11.4.5 of Whitt [2002] (joint convergence when one limit is 
deterministic), we have 

( S^jejuA , ( SjJejuA , T h i e A^. (39) 

Now, we use Skorokhod’s representation theorem (cf. Theorem 3.2.2 of Whitt [2002]) to obtain 
the equivalent distributional representations of the processes in (39) (for which we use the same 


> P 


haQz(t) + h b ( Q 6 (t) + (^ (w A (t) - Q 3 (t)) 




> P 


h a Q 3 (t) + h b\Qs(t) + (j^ ( W A (t ) - Q 3 (t)) 
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symbols and call “Skorokhod represented versions”) such that all Skorokhod represented versions of 
the processes are defined in the same probability space and the weak convergence in (39) is replaced 
by almost sure convergence. Then we have 


(,Sj,j ^ Sj,j 6 JllA, Ti,i € Aj , u.o.c. as r —>• oo. (40) 

We will consider the Skorokhod represented versions of these processes from this point forward and 
prove (38) with respect to these processes. 

By (3), (15), (17), (19), (22), (40), and Proposition 2, we have 

(Q r 3 ,WlQ r 6 ) ^ (Qs,W 4 ,Q 6 ), u.o.c., (41) 


where 


Wl = Ql + ^Q\, a.s., 


(42) 


and all of the processes in (41) and (42) have the same distribution with the original ones. Then by 
Fatou’s lemma, the term in the left hand side of (38) is greater than or equal to 


P ( liminf 

\ r—> oo 


ha (Qm + ( Ql(t ) - QS(i)) + ) + h b (qW) + (Ql(t) - Ql(t )) 


> X 


(43) 


For each t > 0 and sufficiently large r such that h a fi r A > h b [i r B (note that such an r exists by 
Assumption 1 and Parts 2 and 7 of Assumption 3) , we will find a path-wise lower bound for the term 


h a (dm + (dm ~ dm) + ) + h (dm + (dm 



(44) 


From this point forward, we will consider the sample paths in which Q k (t ), k € {3,4, 5, 6} are finite 
for all r and t. By (41) and (42), these sample paths occur with probability one. Let w be a 
sample path and w* be defined as in Section 4. By (42), (44), and the fact that Q^(t) and Qg(t) are 


independent of the control policy, we construct the following optimization problem. For each u> in fl 
except a null set and t > 0 

mm h a (dim - dmt)) + h (dmd - dmt)) , (45a) 

s.t. dmt) + 4 dm t ) = wik), (45b) 

l J B 

Q r M ) > 0, dmt) > o, (45c) 

where Q\{oJt) and are the decision variables. The optimization problem (45) has the same 

structure with the one presented in Lemma 1. Therefore, we can use Lemma 1 to solve (45) and in 
the optimal solution 

dim = dmd a dum = 4 (^tk) - Q3m) + • (46) 

k'A V 7 

Therefore, by (46), a path-wise lower bound on (44) under the admissible policy T r is 

KQm + h («(«) + (d (wl(t) - Ql(t)) + - 05(t)) + J . (47) 
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When we take the liminf^oo of the term in (47), by (41) and the continuous mapping theorem 
(specifically we use the continuity of the mapping (-) + and Theorem 4.1 of Whitt [1980], which 
shows the continuity of addition), (43) is greater than or equal to 

- ^ ^ • (48) 

Note that the lower bound in (48) is independent of control by Proposition 2 and (41). Therefore, 
(48) proves (37) for the Skorokhod represented versions of the processes. Since these processes have 
the same joint distribution with the original ones, (48) also proves Theorem 2. 

7 Weak Convergence Proof 

In this section, we prove Theorem 1. We consider the cases A a < H3 < ha and H 3 > HA separately. 
Note that the proposed policy is the SDP policy in (34) in the first case and the static priority policy 
in (35) in the second case. The proof of the second case is straightforward, but the proof of the first 
case is complicated because the SDP policy is a continuous-review and state-dependent policy. 

7.1 Case I: A a < /13 < UA (Slow Departure Pacing Policy) 

The following result plays a crucial role in the weak convergence of Q\ and Q \] under the proposed 
policy. 



Proposition 4 Under the proposed policy, for all e > 0 and T > 0, 

limP (\\Ql - Ql A Wf 11 T > e) = 0. 


(49) 


The proof of Proposition 4 is presented in Section 8. By (26), (49), and Theorem 11.4.7 of Whitt 
[2002] (convergence-together theorem), we have the following joint convergence result. 

(Qi, QI, QI, QI, QI, Wl) => (Qi, Q 2 , Q 3, Q 3 a w 4 , q 6 , w 4 ) . (50) 

By (17) and (50), we have 

QI => -(W 4 -Q 3 ) + . (51) 

HA 

At this point, we invoke the Skorokhod representation theorem again for all the processes in (39), 
(50), and (51), and we will use the same symbols again. Then, we can replace the weak convergence 
in (39), (50), and (51) with almost sure convergence in u.o.c. for the Skorokhod represented versions 
of the processes. Next, we will consider server 6. Let Zf := Qj A Qg. Then, by (19d) and (23), 


zi(t) = m)+Him, 

m) ■■= s r a (t ) - Ql(t) - Ql(t) V Ql(t) - )) + r(W a - pQt, 


(52) 


where t/g is defined in (52). Since Tg(t) increases only if Z^t) = 0, we have a Skorokhod problem 
with respect to Uf. By the uniqueness of the solution of the Skorokhod problem (cf. Proposition 
B.l of Bell and Willia m s [2001]) /Tg/g = ^(t/g) and Zg = <l>(£/g) for each r. By (40) and the fact 
that {Q^QIQI) {Qi,Q3,Q 3 AW 4 ) 

(h r Jl + r(X r a - He)e, Zf) (~S a + Q1 + Q3 + 5 6 (T 6 ), 0 ) , u.o.c., (53) 
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by Lemma 6.4 (ii) of Chen and Yao [2001]. By (19d), (23), and (53), 

& = S r a -Q[-Ql- Sl(r 6 {t)) + r(X r a - f^e + hVI ^ ~ 

Sa-Qi-Qs- S 6 (T 6 ) - S a + Qi + Q 3 + 5 6 (T 6 ) = 0 u.o.c., 

Ql = S r a -Q\-Ql- Sl(%(t)) + r(X r a - ^)e + ^ 

S a -Qi-Q 3 XW 4 -Sq(Tq) - S a + Qi + Qs + S 6 (T 6 ) = (ch - W A ) + u.o.c. 

By the same way, we can derive the following result for server 7. 

(Q r 9 , Qio) ^ - J- A (w 4 - Q 3 ) + ) + > (^4 - Qs) + - Qe) + ^) , u.o.c. 

Since the Skorokhod represented versions of the processes have the same joint distribution with 
the original ones, when the Skorokhod represented versions of the processes converge almost surely 
u.o.c., then the original processes weakly converge and we get the desired result. 

7.2 Case II: fis > f .la (Static Priority Policy) 

When fi 3 > ha, server 3 is in light traffic because Assumption 3 Parts 1, 2, and 3 imply ha > 
A a . Then Q 3 Q 3 = 0 by Proposition 2. Since server 4 gives static priority to buffer 4 over 
buffer 5 when /U 3 _> HA hi the proposed policy, then buffer 4 acts like a light traffic queue and 
Q\ ==> 0 = Q 3 A W 4 . This implies that all of the workload in server 4 accumulates in buffer 5 and 
Q 5 =>■ (hb / Ha)W 4 by (17) and Proposition 2. The convergence proof of all other processes is very 
similar to the one presented in Section 7.1. 

Remark 6 It is straightforward to see that the proof presented above holds when server 3 is in light 
traffic (X a < h 3 )- Hence, Theorem 1 and the Corollary 1 holds under the static priority policy 
whenever server 3 is in light traffic. Therefore, as stated in Remark f, the static priority policy is 
asymptotically optimal whenever server 3 is in light traffic. 

Remark 7 ITTien h 3 > TA, we do not need Proposition f to get the desired weak convergence result. 
However, the proof of Proposition 4 is straightforward in this case. By Theorem 1 and the continuous 
mapping theorem, Q 4 — Q 3 A W\ => 0, and this clearly gives us the desired result. 

From this point forward, we will only consider the case A a < h 3 < TA aud we prove Proposition 
4 under this case in the following section. The key is the construction of random intervals such that 
in any given interval, we know if server 4 is giving priority to buffer 4 or to buffer 5. We first define 
the intervals and then prove Proposition 4 in the following section 

8 Proof of Proposition 4 

The SDP policy is a dynamic policy that changes the relative priorities of type a and b jobs 
depending on the system state. The analysis of such a policy requires different arguments to show 
Q\ is close enough to Q 3 A Wf to satisfy (49) , depending on which class has priority. This motivates 
us to partition the interval [0, r 2 T] according to the aforementioned priority rules, so that we can 
break the proof of (49) into two different parts. 

We begin with the observation that type a jobs are given priority at all times t £ [0, r 2 T] for 
which Q\{t) > Q 3 (t), and type b jobs are given priority otherwise. Then, we define “up” intervals 
during which Q r 4 (t) > Q 3 (t) and “down” intervals during which Q\(t) < Q 3 {t) as follows. In the rth 
system, let t, f : P —> M + U {+ 00 } be such that Tq := 0 and 

tL —1 : = inf{i > tL - 2 : Q r 3 (t) = Ql(t ) - 1}, Vn £ N+, (54a) 
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r 2n : = ™f{t > TL-1 ■ Ql(t) = Vn <E N+. 


(54b) 


For completeness, if t” q = +oo for some no G N+, we define r” := +oo for all n > no- Finally, we 
bound (49) using the “up” and “down” intervals so that 

P(||Q 4 -Q 3 AWl|| r >e)=P( sup \Ql(t)-Q r 3 (t)AWl(t)\>re) 


\0<t<r 2 T 

<p( sup \Ql(t) - Q r 3 (t) A W£(t)\ > re 

Ue[0,r 2 TlnU^dr^.^J ; 


(55) 


+ P sup 

\te[o,r*T]nU~ =0 [TS n ,Tr n+1 ) 


\Ql(t)-Q r 3 (t)AWZ(t)\>re\, 


(56) 


where e > 0 and T > 0 are arbitrary. For the proof, it is sufficient to prove that both of the 
probabilities in (55) and (56) converge to 0 as r —> oo. 

The reason why the probability in (55) converges to 0 as r —»• oo relies on the up interval 
construction. During an up interval, Q\ > Q 3 , so Q 3 A W\ = Q 3 by (17), and server 4 gives priority 
to type a jobs. Since server 4 is faster at processing type a jobs than server 3, Q\ never becomes 
much larger than Q 3 . We make this argument rigorous in Section 8.1 below. 


<? 3 r (t) = <37(0, <? 5 r (t) > 0 
vt e [r 2 r ni r^*) 

K 


Q 4 (0 ^ Q 3 (t).Vt £ [t 2 ^ ,T 2n+1 ) Q 3 (0 ^ Q 4 (Oi Vt *= 1 T 2 n+l* T 271+2) 
A i 


Down 1 interval 


Down 2 interval 


Up interval 


<?3 r b 2 r n) = <34 r (r 2 n) 


<? s r (r 2 r n ’) = 0 


Q 3 (r 2 7+i) = Q^G n+1 )-l 


<?3 r (T 2 ’ n+2 ) = <2 4 r (T 2 r n+2 ) 


Priority to Buffer 5 Priority to Buffer 4 

Figure 4: Illustration of the down 1, down 2, and up intervals when A a < < yt a■ 


The argument to see the probability in (56) converges to 0 as r —f 00 requires the splitting of 
the down intervals, as shown in Figure 4. To see this, first observe that at the beginning of a down 
interval, If Q^t^) > 0, then server 4 works only on type b jobs until the first 

time buffer 5 empties, defined as 


r,* 

r 2n 


inf {t G [r 2 r n ,T 2 r n+1 ) : Q£(f) = 0} , if Q£(i) = 0 for some t G [r 2 r n , r 2 r n+1 ) 
< where r 2n < 00 , 

+ 00 , otherwise. 


(57) 


Note that it is possible that the next up interval starts before buffer 5 empties, in which case := 
00 . For t G K,r£), QS(t) = QS(t). Since Q§(t)AWI(i) = Q£(t) by (17), QS(*)-QS(*) AWI(t) = 0 
trivially. During the interval [^n, T 2 n+i) (when it exists), Q 5 behaves like a light traffic GI/GI/1 
queue starting from the origin. Hence, Q\ stays close to 0, so that Q\ ~ WJ. Since also Q\ < Q 3 by 
the down interval construction, Q 3 A W\ & Q If Qs(T£ n ) = 0, then r 2n = r 2 ^*, so that the second 
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half of the above argument works for this case as well. We make this argument rigorous in Section 
8.2 below. The key is the following convergence result, that is of interest in its own right. 

The light traffic GI/GI/l Queue Convergence Rate Result Let £ N + } and {v^,k £ 

N+j" 5 be two independent sequences of strictly positive and i.i.d. random variables such that E[ui] = 
1/A and E[ui] = 1/fJ. Consider a GI/GI/1 queue with infinite buffer capacity and FCFS service 
discipline such that the interarrival time between the (k — l)th and kth job arriving in the system 
after time 0 is Uk and service time of the kth job is Vk for all k £ N+. We assume that there exists 
an open interval centered at zero denoted by (—a, a) where a > 0 such that E[e aui ] < oo and 
E[e ai)1 ] < oo for all a £ (—a, a). Therefore, we assume exponential moment assumption for u\ and 
v\, i.e., all moments of u\ and v\ are finite. Let Q(t) denote the total number of jobs in the system 
at time t > 0, and Q(0) be a random variable which takes values in N. Then, we have the following 
large deviations result. 

Proposition 5 Suppose that A < fi, i.e., the queue is in light traffic. Fix arbitrary n £ N+ and 
e > 0 , and suppose that there exists an ro > 1 such that if r > uq, 

P ^Q(O) > Ae M < Cor 2n-i e -C ir ^ (58) 

for some constants Co > 0 and C\ > 0 which are independent of r. Then, there exists an r\ > 1 
such that if r >r\, 

P ( sup Q(t) > re J < C 2 r 2 n - 1 e" C3? ', (59) 

\0 <t<r n J 

such that C 2 > 0 and C 3 > 0 are constants which are independent of r. 

The proof of Proposition 5 is presented in Appendix A. 

8.1 Proof of Convergence of (55) (Up Intervals) 

Throughout Section 8.1, let 7 £ (0,1) be an arbitrary constant, a = 0(7) > 0 be a constant such 
that 4/(1 + a) < 7. Let N r := \{p ,3 + ei)r 2 T] where e\ > 0 is an arbitrary constant. Since there is 
a service completion in server 3 at T 2 n _i for each n £ N+, then 

P (t 2 V_ 1 < r 2 T ) < P ( S r 3 {r 2 T ) > W) < P ( S r 3 {r 2 T) > + e x )r 2 T) ^0 as r 00 , (60) 

where (60) is by functional strong law of large numbers (FSLLN) for renewal processes (cf. Theorem 
5.10 of Chen and Yao [2001]). 

Since, by construction, Q\{t) > Q 3 (t) for all t £ [pL-i, r 2 n) an d n € N+, then Q 3 (t) A Wf (t) = 
Q r 3 (t) for all t £ [ffin-i- ffin) and n £ N + by (17), and the probability in (55) is equal to 


P [ sup Ql(t) - Q r 3 (t ) > re j < 

\t£[0,r2T]n{J~ =1 [ J 

P (r 2 ATr_i < r 2 T) + P ( sup Q\{t) - Q r 3 {t) > re, t% n r_ x > r 2 T J . (61) 

VtG[0,r 2 T]nU^ =1 [r 2 7_i,r 2 7) j 


3 We present a general result in this section, thus we use a general notation in this section and in Appendix A, where 
we present the proof of the result of this section. 
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Note that the first probability in the right hand side of inequality (61) converges to 0 as r -> oo by 
(60). Hence, it is enough to consider the second probability in the right hand side of (61) which is 
less than or equal to 


N r 


E P SU P QKt) ~ QS(*) > r 2n-i < r 2 T 

^ \ r 2n-l— i<r 2n J 

E P ( SU P S K T S(t)) - S r A (T r A (t)) > re, 4_r < r 2 T ) 

E p f sup ^(TJ(t 2 V 1 + t)) - ^(TiK n _r + t)) > re, r 2 r n _i < r 2 T) 

E P ( sup ^(TJ(t 2 Vi) + t) - ^(TlK„_i) + t) > re, r 2 Vi < r 2 T) 

^ \ 0 < r 2n — r 2n -1 / 


(62) 


(63) 


where (62) is by (3b). We obtain (63) in the following way. Server 4 works on buffer 4 during the 
whole up interval by construction. However, server 3 can be idle during an up interval. Hence, we 
have for all t € [ 0 , r 2n — T 2n-1 ) 


T r A {r r 2n _i + t) = Tl(rLi) + t, T^ n _ 1 + t)< T^ n _ i) + t, 


(64) 


which gives (63). 

We expect the sum in (63) to converge to zero because each term is the difference between two 
renewal processes with different rates, and the faster renewal process is the one being subtracted. 
To formalize this intuition, it is helpful to bound those differences by using the processes 


E% n (t) := sup ■ 


k 


keN:}v r 3 (l + B r n )<t 


i=i 


KCtt) ■= su p 


|k € N : E v a(1 + ^n) ^ t \ I 


where Ya=i x i ■= 0 for an y sequence {x h l € N+}, A r n := -S , ^(T^(rJ n _ 1 )), and B r n := S$(T£ (t^J) . 
Then by (3b) and (54a), A r n = B^ — 1 . We have the following result. 


Lemma 2 For all t > 0, ^(T[(r^_,) +1) - ^(^(r^_,) +t) < i£ n (t) - E r f(t ) + 1. 


The proof of Lemma 2 is presented in Appendix B.3. By Lemma 2, (63) is less than or equal to 
N r / \ 


E P sup E 3 ’ (t) - (i) > re - 1 , r 2n _! < r T . 

JV r / 

^E p 


(65) 


sup *£"(*) - E r /(t) > re - 1, r^r < r 2 T 

= 1 V 0 - i< ( T 2n- T 2r l -l) V ( ?-7T ) J 


N r 


= E p sup E 3 ’ n (t) - E r J^{t) > re - 1, r 2 r n - r 2 r n _i > r 7 T, < r 2 T 

V°- t<r 2»- r 2n-l ) 

+ E P ( SU P ^3’"^) “ > re - !. r 2n - tL-i < r^T, T 2 r n _ 1 < r 2 T ] 

\ 0 - t<r7T / 

N r 

< E P K - TL-i > r 7 T, < r 2 T ) (66) 


n=l 
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N r / 

+ ^ p ( sup E r 3 ' n (t) 

n=\ \0<t<r^T 


EY(t) > re 


1 , Ci < 


r 2 T 


(67) 


We will show that both of the terms in ( 66 ) and (67) converge to 0 as r -> oo. 

8.1.1 Proof of Convergence of (66) (Length of Up Intervals) 

In this section, we show that the sum in ( 66 ) converges to 0 as r —> oo. This implies that the length 
of the up intervals within the time interval [0, r 2 T] is o p (r s ) for any 6 > 0. Let denote the event 
inside the probability in ( 66 ), i.e., 

C ■= { r 2n - T 2n -1 > ^T, r 2n-l < } • ( 68 ) 


By (3b) and the fact that Q A (t) > Q 3 (t) for all t £ Cn-i > T 2 n)- ( 66 ) is equal to 


E p ( ™p mb) - sjpj w> < o, d 

n=l VL-l<t<^ n -l+ rlT J 

N r / 

= E P SU P s A(T r A (rL-i + *))- s r 3 (n (C_1 + t)) < 0, er n 

Cl \0<t<r-yT 
N r ( 

< E P SU P C(C(C-i) + t) - CCC (C-i) + i) < 0, C 

n=l \0<t<r^T 

< E P ( SUP E A n ^) - E Ttt) < r 2n-l < r 2 rV 

n=r V°^ <r7T / 


(69) 

(70) 


where (69) is by (64), and (70) is by Lemma 2. 

Similar to (63), we consider the difference of two renewal processes in (70). We want to show 
that the probability that the number of renewals associated with the renewal process with higher 
renewal rate is always less than the number of renewals of the one with smaller renewal rate within 
a time interval of length r 7 T converges to zero as r —> oo. Let 


Ci ; = { SU P \ E 3 n (t) ~ /Cl < £ 2 r 7 T, sup | E r ^(t) - fi r A t | < e 2 r 7 T \ , (71) 

I 0<t<rlT 0 <t<r~<T \ 

where e 2 is an arbitrary constant such that 0 < e 2 < /13 A ((ea — /u. 3 )/2). Then, the sum in (70) is 
equal to 


N r 


E P sup E% n (t) - E% n (t) < 1, T r 2n _ x < r 2 T, £, 

1 0<t<r^T 


r 

tt .,1 


n=l 


+ 


7V r / 

£ p 


77.—1 


sup *£"(*) - ££"(t) < 1, C_r < r 2 T, (^ , 

y0<t<ri'T / 


(72) 

(73) 


where superscript c denote the complement of a set. Note that, on the set £^ x , we have for all 
t € [ 0 , r 7 T), 


/C - (*) £ /C + e 2 r 7 T, fi r A t - e 2 r 7 T < (t) < + e 2 r 7 T. (74) 


25 


This implies that the sum in (72) is less than or equal to 


V' P ( sup p A t - e 2 r 7 T — (p 3 t + e 2 r 7 T) < l^j 
n=l \0<t<r^T J 

= N r P ((p r A - ^)r 7 T - 2e 2 r 7 T < 1) (75) 

= ( ^ A 2 < e 2 ) 0 as r —>■ oo, (76) 

where (75) is by the fact that p A > p 3 for all r > r± for some r± £ M+; and (76) is by the fact that 
e 2 < (pA — Pa)/2. Now, let us look at the sum in (73), which is less than or equal to 


N r 


E p ( T 2n-i<r 2 T, (£ r nji y) 


n= 1 


< 


N r / 

EW 


\o <t<r^T 


sup |^ 3 r ’ n (t) - p r 3 t\ > e 2 r 7 T, r 2 r n _i < r 2 T 


N r 


+ P sup I“ VaA > £ 2r 7 T, r 2n _! < r 2 T . 


71=1 


0<t<r / >'T 


(77) 

(78) 

(79) 


It is straightforward to see that the sums in (78) and (79) converges to 0 as r —> oo by the following 
result, whose proof is presented in Appendix B.4. 


Lemma 3 For all 7 > 0, e 2 > 0 such that e 2 < p 3 A ((pa — P3)/2), 


N r 


E p 

71=1 


sup 

Ej ,n (t) — p T jt 

\0<t<r'yT 



and j £ {3, A}, 

-A 0, as r -A 00. 


(80) 


Note that Lemma 3 extends Lemma 9 of Ata and Kumar [2005] to a renewal process that starts from 
a random time. Consequently, the sum in (66) converges to 0 as r — > 00. 


Remark 8 The sum in (66) converges to 0 for all 7 > 0 as r -A 00. We need 7 < 1 in the next 
section (see (84) J. 


8.1.2 Proof of Convergence of (67) 

In this section, we will show that the sum in (67) converges to 0 as r -A 00. The sum in (67) is less 
than or equal to 

N r N r 

P (tL-1 < r 2 T, (^pD+^T P 

71=1 71=1 

where £f x is defined in (71). Note that the first sum in (81) converges to zero with the same way 
(77) does. The second sum in (81) is less than or equal to 



sup E% n (t)-E% n (t)>re-1, 


\ 0<Kr 7 T 


T 2n-1 E 


r 2 T, 


£ 


71,1 


(81) 


5^ P ( sup E 3 (*) _ E a (*) >re-l, £ n 

\0<t<r^T J 

N r / 

< P ( sup p 3 t + e 2 r 7 T — ( p A t — e 2 r 7 T) > re — 1 

rTEl \0<t<rrT 


(82) 
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as r —> oo 


= N r "P (2e 2 r 7 T > re - 1) 

= N r I(2e 2 r' y T > re - 1) -7 0 


(83) 

(84) 


where (82) is by (74); (83) is by the fact that when r is sufficiently large, we have \i r A > [ 1 %; and (84) 
is by the fact that 7 < 1 . 

Therefore, we prove that (55) converges to 0 as r —> 00 . In the next section, we will prove that 
(56) converges to 0 as r —> 00 . 

8.2 Proof of Convergence of (56) (Down Intervals) 

In this section, we consider the down intervals and prove the convergence of (56). Let N 2 := 
\(n r A + 63 )r 2 T~| where 63 > 0 is an arbitrary constant. Since there is a service completion of a type 
a job in server 4 at r 2n for all n G N+, then as r —> 00 

P (r- 2 N S < r 2 T) < P ( S r A (r 2 T) > N r 2 ) < P ( S r A (r 2 T ) > + e 3 )r 2 T) ^ 0 as r 00 , (85) 

where (85) is by FSLLN. Let 

T r n (t):=\Ql(t)-Q r 3 (t)AWm- (86) 

Then, the probability in (56) is less than or equal to 

P U N r < r 2 T) + P ( sup T r n (t) > re, r r 2N r > r 2 T ) . (87) 

V 7 \ 4e [°’ ? ' 2T ] n U^o[^.^ +1 ) J 


Note that the first probability in (87) converges to 0 as r ^ 00 by (85) and the second probability 
in (87) is less than or equal to 
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n=0 \ T 2n< t < T 2n+l Ar2 ( T+e ^ / 
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sup T r n (t) V 


sup 


r 2 r ' n <t<T2;*Ar 2 (T+e 4 ) 


r r n(t) 


x i(r 2 ; < 00 ) + 


sup F n (t) 

. T 2»<*<' r 2n+l Ar2 ( T + e 4) 


t 2 ™ Ar 2 ( T + £ 4 )< 4 <' r 2 n+l Ar2 ( T + £ 4 ) 

H T 2 n = 00 ) > re, r r 2n < r 2 T j, 


( 88 ) 


where 64 > 0 is an arbitrary constant introduced to cover the time instant r 2 T. For completeness, we 
define sup tg 0 X(t) = 0 for any X G D. By definition (cf. (57)), t 2 * = 00 implies that down 2 interval 
does not exist within [ r 2 n> r 2 n+l)> thus buffer 5 never becomes empty during the corresponding 
down interval. Hence, server 4 does not work on buffer 4 during the same interval and the down 
interval ends with the first service completion in server 3. Therefore, when t 2 * = 00 , Q\{t ) = Q^(t) 
and rjj(t) = 0 for all t G \r 2n , T 2n+ \ ) by ( 86 ) and the fact that Q\(t) < W[(t) for all t > 0 (cf. 
(17)). Moreover, when t 2 * < 00 , by the same logic, we have Q\{t) = Q^(t) and rjj(t) = 0 for all 
t G \r 2n , T 2n ]. Therefore, the only nonzero term in ( 88 ) is the one associated with the down 2 interval 

( r 2n > r 2n+l)> and ( 88 ) is eC l Ual to 


E p 

n =0 


sup F n (t) 

T 2n Ar2 ( T + £ 4) <t<Tj„ +1 Ar 2 (T +£4 ) 




< 00 ) > re, r 2n < r 2 T 


(89) 
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From the preceding argument, it is enough to show that the term in (89) converges to 0 as r —>• oo. 
The term in (89) is equal to 


£ p 


sup 


=0 V |_ r 2n < t<r 2n+l Ar2 ( T + £ 4) 


Kit) 
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= 0 \ T 2n < t <' r 2n+l Ar2 ( T + £ 4) 
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=0 \r a r ;* <t<T- n+1 Ar^T+e 4 ) 


sup Q r 3 (t) A Wl(t) - Ql(t) > re, < r 2 (T + e 4 ) 


m 
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E p 
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T 2n < t<r 2 T 'n+l Ar2 ( T + e 4) ^B 


Ql(t)>re, r 2 t<r 2 (T + e 4 ) , 


n=0 \ r 2 n <t<r 2 n+l 


(90) 

(91) 


where (90) is by (17), ( 86 ), and the fact that Q 4 (t) < Q^[t) for all t S (Kn ■ T ' 2 n+i ) by construction; 
and (91) is by (17). Let £5 := inf r > n (fi r R / j_i r A )e for some r\ > 0. By Assumption 3, £5 > 0 when r 4 
is sufficiently large. Then, when r > r 4 , the sum in (91) is less than or equal to 


^ ( 

P SU P QEW > re 5 , r 2 ;* < r 2 (T + e 4 ) 

n=0 \ T 2n< t < T 2u+l Ar2 (. T +^) 


(92) 


Note that, when r 2 n < oo, Qfj( T 2 n ) = 0 f° r all n G N by construction. 

By Assumption 3 Parts 1, 2, and 3, /i^ > \ r b for r sufficiently large. Since server 4 gives priority to 
buffer 5 during a down 2 interval, each probability within the sum in (92) resembles the probability 
that the supremum of the buffer length in a light traffic GI/GI/1 queue is at least 0(r) within a 
down 2 interval. Moreover, there are 0(r 2 ) probabilities in the sum in (92). Therefore, the key to the 
proof is to first construct a specific light traffic GI / GI /1 queue and then bound each probability in 
the sum in (92) by the probability that the supremum of the buffer length process in the constructed 
GI/GI/1 queue is greater than res within a compact time interval with length 0(r 2 ). Then we 
obtain the desired convergence result by Proposition 5. 

8.2.1 Construction of the light traffic GI/GI/1 queue 

In this section, we construct a light traffic GI/GI/1 queue to bound the sum in (92). This is doable 
because by Assumption 3 Parts 1, 2, and 3, A & < /.ib■ However, the approach is different depending 
on if server 2 is in light traffic (A b < ^ 2 ) or heavy traffic (A& = / 1 2 ); see the case 1 and case 2 below. 

When server 2 is in light traffic, the assumption that server 2 has instantaneous processing 
capacity allows us to bound Q 5 over (T 2 ^*,r 2ri+1 A r 2 (T + e 4 )) for each n by a light traffic GI/GI/1 
queue with arrival rate A&, service rate hb, and initial buffer length Q^i^n)’ which is not too large. 
The size of the initial buffer length cannot be controlled when server 2 is in heavy traffic. In that 
case, since \b = ^2 < Hb, the light traffic GI/GI/1 queue bound is obtained by pretending there is 
an infinite number of jobs in front of server 2.(The reason why this approach does not work when 
server 2 is in light traffic is that /^2 > /r_B is possible.) 

Case 1: Server 2 is in light traffic In this section, we prove that the sum in (92) converges 
to 0 as r —> 00 under the assumption that server 2 is in light traffic (A& < /i 2 )• Let, N 3 := 
[(A)) + [x r B )r 2 (T + e 4 )"|, k > 0 be an arbitrary constant such that k < /12 A [13 ~ A b, and 


£n ,2 := {Sb(Ib(t£-)) < JVJ, SUt£) < iVJ, Q’ 2 (t£) < ((,.2 Aw-«-A,)A €s) r/2} . (93) 


28 




Then the sum in (92) is less than or equal to 


Ag 




sup 


=0 \' r 2n< t <' r 2n+l Ar2 ( T + £ 4) 


Qm>™ 5 , r 2 ^<r 2 (T + e 4 ), C 2 


Ag 


+ £P(g„* <rHT + ti), (e r „,2r) 


n =0 


(94) 

(95) 


The term in (95) converges to 0 because Q 2 is a light traffic GI/GI/1 queue, and the number of 
external type b job arrivals and service completions associated with activity B in an interval of length 
0(r 2 ) can be bounded with high probability using the rate of the renewal processes. We have the 
following result whose proof is presented in Appendix B.5. 


Lemma 4 As r -> oo, En=o P ( T 2 n <r 2 (T + e 4 ), (£^ 2 )°) °- 


To complete the proof of Proposition 4 when server 2 is in light traffic, it remains only to show 
the sum in (94) converges to 0 as r —> oo. To do this, we first couple with a hypothetical single 
server queue in which type b jobs are given priority even beyond time T 2n+1 , so that the random 
time at the end of the interval (t^*) r 2 n+i A r 2 (T + e 4 )) can be ignored. Next, we assume that the 
processing at server 2 is instantaneous, and that the first arrival which occurs after time t 2 * occurs 
instantaneously, in order to create an upper bound light traffic GI/GI/1 queue with arrival rate A r b 
and service rate [i r B . Finally, we show that the upper bound light traffic GI/GI/1 queue converges 
to 0 as r — > oo by Proposition 5. 

By sample-path arguments, we will construct an upper bound (depending on r) to the sum in 
(94) and show that this upper bound converges to 0 as r —> oo. Let us define a new hypothetical 
queueing system in which we use the same control until r 2n , 1 but after this time epoch, server 4 
always gives static preemptive priority to buffer 5 over buffer 4. We call this hypothetical system as 
“system 1". Let the queue length process for buffer k, k £ 1C be Q/ in system 1. Then, clearly, 
for each uj £ bl, Q^’ r (ui t ) = Q$(ujt) for all t < t 2ti+1 (uj ) and the sum in (94) is less than or equal to 


Ag 


n =0 
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K T 2n <t<r 2 (T+eA 


Qs ),r (t) > re 5 , t 2 * < r z {T + e 4 ), 8 r n2 


EEE P r . SU 2 P 

n =0 i=0 j =0 V t 2 ; <t<r 2 (T+ei) 


Q 5 ],r {t) > re 5 , t 2 * < r 2 (T + e 4 ), 


Sb(Ib(t£,*)) = i, SI(tS) = j, Q ( A r (rS) < 


((^2 A HB - K - h) A e 5 ) r 


(96) 


where (96) is by (93). 

Next, let us construct another hypothetical queueing system, which we call “system 2”, by mod- 

12) r 

ifying system 1 in the following way. Let the queue length process for buffer k, k £ /C be Q y k ’ 
in system 2. For each uj £ such that t 2 *{uj) < r 2 (T + e 4 ), server 2 can process jobs instanta¬ 
neously after t 2 *(oj). Furthermore, assume that the next arrival after time t 2 *, which occurs at 
time V//{S[{t 2 *) + 1) occurs instead at time t 2 *. This implies that when t 2 *(uj) < r 2 (T + e 4 ), 
Q^’ r (ujt) = Q^’ r (ujt) for all t < t 2 *{uj ); and Q^ ,r {ujt) > Q^’ r (ujt) for all t > t 2 *(uj). Note that, 
Q^’ r ( r 2 n = 0 on the set {t 2 * < oo} and Q^' r (t 2 *{uj)) = Q^ 2 ,r (t 2 * (uj)) + 1 on the same set 
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because server 2 depletes buffer 2 instantaneously starting from t/’* (u) and the next arrival occurs 
immediately in the system 2. 

In summary, after r// , buffer 5 behaves like a GI/GI /1 queue with initial buffer length less than 
or equal to ((/i 2 A fi B — k — Aft) Aes)r/2 + l, service times 

{vUSUm t")) +k),ke N + } , (97) 


and interarrival times 

KSffl+ltO, !6N+}. (98) 

Since there is a service completion related to activity B at time t^’*, the first service time in (97) is 
V R(Sr )) + 1- Then the sum in (96) is less than or equal to 


N r 2 N$ iVg 

EEEP 

n= 07—0 j =0 


sup 

r 2n <t<r 2 (T+e 4 ) 


Q?’ r {t) > re 5 , < r\T + e 4 ), S r B (T r B (r £)) = i, SJ(r£) = j 


(99) 


given that (t 2 *) < ((^ 2 A - ac - A fe ) A e 5 ) r/2 + 1 . 

At this point we use the i.i.d. property of the service and interarrival times. For all i,j £ 
{0,1,... , iVg}, the interarrival times are equal to {u£(/), l £ N_|_} in distribution, and service times 
are equal to { v r B (k ), k £ N + } in distribution by (97) and (98) in (99). Then, let us construct a 
hypothetical GI/GI /1 queue, where the buffer length process is denoted by Q^’ r , the interarrival 
and service time sequences are l € N + } and { v B (k ), k £ N+}, respectively, and Q^’ r (0) = 

((n 2 A fi B — k — A b) A 65 ) r/2 + 1. It is possible to see that the sum in (99) is less than or equal to 


AJ(A^) 2 P 


sup 

\0<A<r 2 (T+e 4 ) 


Q ( 5 ) ' 1 '( t ) > re 5 


( 100 ) 


given that Q^ ,r (0) = ((// 2 A / j, b - At - Aft) A £5) r/2 + 1. 

Lastly, let us construct a hypothetical GI/GI /1 queue, which we call “system 4” and where the 
buffer length process is denoted by qI^' . In system 4, the interarrival and service time processes are 
{v r b (l){\ r b /(\b + k/2)), l € N + } and {v r B (k)(n B /(n B - ac/2)), k £ N+}, respectively 4 , and (^(O) = 
((/i 2 A [i B — k — Aft) A £ 5 ) r/2 + 1. Hence, the arrival and service rates in the hypothetical GI/GI /1 
queue in system 4 are Aft + At/2 and p i B — ac/2, respectively. By Assumption 3 Parts 1 and 2, there 
exists an r 2 > 1 such that if r > r 2 , the term in (100) is less than or equal to 

N r 2 (IV 3 r ) 2 P ( sup g? ) (t)>re 5 ], (101) 

\0 <A<r 2 (T+e 4 ) J 


given that Q b (0) = ((/i 2 A [i B — At — Aft) A £ 5 ) r/2 + 1. The term in (101) converges to 0 as r -> 00 
by the fact the corresponding server is in light traffic and Proposition 5. 

Case 2: Server 2 is in heavy traffic In this section, we prove that the sum in (92) converges to 
0 as r -> 00 under the assumption that server 2 is in heavy traffic (Aft = /i 2 ). Now, Lemma 4 does 
not hold because we cannot show that the event {(^(t^*) < e 6 r ) T 2 n — r2 (T + £ 4 )} occur with high 

4 Since v b (l)(A b /(A b + At/2)) = v b (l)/(A b + At/2) and Vg(l)(fi r B /([iB — ac/2)) = vb{1)/{^b - At/2) for all l £ N+ (cf. 
Section 3.1), the corresponding sequences are independent of r 
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probability for any > 0 in this case. However, we can solve this problem by modifying the part of 
the proofs starting from (93) in the following way. First, we modify (93) by 


^,2 := {^(T^(r^)) < 7V.() . (102) 

Then, again, the sum in (92) is less than or equal to the sum of the terms in (94) and (95), and 
(95) converges to 0 as r —> oo by the convergence of the sum in (201) to 0 (cf. proof of Lemma 4 in 
Appendix B.5, specifically see (204)). 

We again use sample-path arguments to construct an upper bound (depending on r) to the sum 
in (94). First, let us construct the hypothetical system 1 as before. By (3b) and the fact that 
Q ( 5 1 ),r ( T 2 n) = 0 on the set {r 2 ;* < oo}, 

S r B (T r B (T r 2 ’ n *)) = S5CZ!(t£)), (103) 

on the set {t 2 * < oo}. By (102) and (103), the sum in (94) is less than or equal to 


> re 5 , r 2 ;* < r 2 (T + e 4 ), S r B (T r B (r 2 ;*)) = S t 2 {T^(t^)) = i J , 


p ( su p 

n=0 i=0 \ r 2n< t < r ' 2 ( T + e ^) / 

(104) 

given that Q^ ,r {t 2 *) = 0. Next, let us construct the hypothetical system 2 in the following way. 
We assume that at time t 2 *, all remaining type b jobs arrive immediately to buffer 2. Then, server 
2 never becomes idle after time t 2 *. Furthermore, assume that the next service completion in server 
2 after time t 2 * , which occurs at time V 2 {S 2 {T 2 (t 2 *)) + 1) occurs instead at time t 2 * . This implies 
that when t 2 ’*(u >) < r 2 (T + e 4 ), Q^ ,r {uj t ) = Q^’ r (u t ) for all t < t 2 *(u ;); and Q ( f >,r {u t ) > Q^’ r (uit) 
for all t > t 2 *{oj). Note that, Q^’ 7 (t 2 *(uj)) = 0 on the set {t 2 * < oo} and Q^’ r ( t 2 *(uj)) = 1 on 
the same set because the next service completion in server 2 after t 2 * occurs immediately in the 
system 2. 

In summary, after r 2n , buffer 5 behaves like a GI/GI/1 queue with initial buffer level 1, service 
times 

{»b(Sb(IS(t£*)) + *),*£ N + } , (105) 

and interarrival times 

WfSSCU( O) +1 + 0 . ‘ € N+} . (106) 

Then the sum in (104) is less than or equal to 

^ ^ / \ 

sup Q < 5 ),r ( t ) > re 5 , r 2 r n * < r 2 (T + e 4 ), S b (T b (t 2 *)) = S%(T£(t 2 ’*)) = i I, 

n=0 i=0 V r 2 n <t<r 2 (T+e 4 ) J 

(107) 

given that Qf ),r (r 2 *) = 1 . 

At this point we use the i.i.d. property of the service and interarrival times. For all i € 
{ 0 , 1 ,... , Alg} the interarrival times are equal to {v 2 (l), l £ N+} in distribution, and service times 
are equal to {v B (k), k £ N + } in distribution by (105) and (106) in (107). Then, let us construct a 
hypothetical GI/GI/1 queue, where the buffer length process is denoted by Q 5 3 '*’ r , the interarrival 
and service time sequences are {v 2 (l), l £ N + } and {v B (k), k £ N + }, respectively, and Q^’^O) = 1. 
Then, it is possible to see that the sum in (107) is less than or equal to 


N 2 N$P 
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(108) 
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given that Q®’ r ( 0 ) = 1 . 

Lastly, let us construct a hypothetical GI/GI/1 queue, which we call “system 4” and where 
the buffer length process is denoted by Q^ ■ Let us modify the definition of k > 0 such that k < 
/Lb — a *2 - In system 4, the interarrival and service time processes are {v^ (OO -^/(/*2 + ac/2)), l G N+} 
and {v r B {k){n r B /(nB ~ k/2)), k G N + }, respectively, and (0) = 1. Hence, the arrival and service 
rates in the hypothetical GI/GI/1 queue in system 4 are fi 2 + k/2 and hb — k/2, respectively. By 
Assumption 3 Parts 1 and 2 , there exists an r% > 1 such that if r > 7 * 3 , the term in (108) is less than 
or equal to 

N 2 i N 3 ) 2 p ( sup Q< 4 ) (f) >re 5 ) , (109) 

\o<t<r 2 (r+ e 4) / 

given that Q !- 4 (0) = 1. The term in (109) converges to 0 as r -> 00 by the fact the corresponding 
server is in light traffic and Proposition 5. 

Remark 9 We assume that Q r k ( 0) = 0 for all k G K, up to now. However, it is straightforward to 
see that the results presented so far hold under the following weaker assumption. 

Assumption 4 For each r, (Q k (0),k G /C) is a nonnegative random vector which takes values in 
N 10 such that 

1. QS( 0 ) = Ql( 0 ), Q'( 0) = Q§(0), Q r 5 (0) = QUO), and Q r 9 ( 0) = QI 0 (0) for each r. 

2. r~ 1 Q k ( 0) => 0 and r~ 2 Q k (0) > 0 for all k G 1C as r -» 00 , 

3. If server 2 is in light traffic, i.e., \ < P 2 , then for each e > 0, there exists a k G (0, ^2 A//b — At) 

and r 0 > 1 such that if r > r 9 , 

P (Qa(0) > ((/^2 A p B ~ k ~ A&) A e) r/2) < C 0 r 3 e~ Cir , 

for some constants Cq > 0 and C\ > 0 which are independent of r. 

Assumption 4 Part ( 1 ) guarantees that the initial buffer lengths do not violate (11) at time t = 0. 
We need Assumption 4 Part ( 2 ) because of two main reasons. First, since we consider cases in which 
some of the servers are in light traffic, in order to obtain the weak convergence of the diffusion scaled 

queue length processes corresponding to these servers to 0, we need the first convergence result in 

Assumption 4 Part ( 2 ). Second, in order to obtain the a.s. convergence of the fluid scaled queue 
length processes in each buffer to 0 u.o.c., we need the second convergence result in Assumption 4 
Part (2). We need Part (3) of this assumption in order to invoke Proposition 5 in the proof of 
Lemma 4, specifically in (206). 

Remark 10 It is possible to weaken the exponential moment assumption (cf. Assumption 2) for 
some of the interarrival and service time processes. First, in Section 8.1.1, we use the exponential 
moment assumption only for the service time processes associated with activities 3 and A in the proof 
of Lemma 3. However, we can relax this assumption in the following way. We need 4/(l + a) < 7 < 1 
(cf. proof of Lemma 3). Since 7 G (0,1) can be chosen arbitrarily close to 1, any a strictly greater 
than 3 satisfies 4/(1 + a) < 7 < 1. Since we need E[(tA(l)) 2+2 “] < 00 for j G {3, A} in the proof of 
Lemma 3 (cf. (198) and (200)/, E[(rA(l)) 8+/3 ] < 00 , j G {3, A} for some /3 > 0 is sufficient to get 
the convergence result associated with (55). Second, in Section 8.2.1, we have used the exponential 
moment assumption for the interarrival times of type b jobs (cf. ( 101 ) and (206), which is in the 
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proof of Lemma 4), service times associated with activity 2 (cf. (109) and ( 206 )), and service times 
associated with activity B (cf. (101) and (109) J. For the interarrival times of type a jobs and the 
service times associated with activities 1, 5, 6, and 7, we only need finite second moment assumption 
to use the FCLT in Sections 7.1 and 7.2. 

9 Simulation 

In this Section, we use discrete-event simulation to test the performance of the non-preemptive 
version of the proposed policy which is described in Remark 1. We consider 36 different test instances 
and at each instance, we compare the performance of the non-preemptive version of the proposed 
policy with the performances of 4 other non-preemptive control policies. The instances are designed 
to consider various cases related to the processing capacities of the servers and variability in the 
service times. We describe the simulation setup in Section 9.1, then we present the results of the 
experiments in Section 9.2. 

9.1 Simulation Setup 

At each instance, the arrival processes of type a and b jobs are independent Poisson processes with 
rate one, thus A a = Aj, = 1°. At each instance, servers 5, 6, and 7 are in either heavy or light traffic, 
where heavy and light traffic mean 95% and 70% long-run utilization rates of the corresponding 
server, respectively. For example, when server 6 is in heavy (light) traffic at an instance, then 
/ig = 1/0.95 (/tg = 1/0.7), which gives the desired long-run utilization rate. At each instance, servers 
1 and 2 are in light traffic and server 4 is in heavy traffic such that pa = Pb = 2/0.95. At each 
instance, we assume that p^ £ {1/0.95,1/0.7125,1/0.35} to test the performance of the proposed 
policy when A a « p 3 , A a < P 3 < PA, and p 3 > p a, respectively. 

We use three different distribution types for the service time processes, namely Erlang-3, Ex¬ 
ponential, and Gamma distributions. When the service time process associated with an activity 
is Gamma distributed, we choose the distribution parameters such that the squared coefficient of 
variation of the corresponding service time process is 3. Note that the squared coefficient of vari¬ 
ations in Erlang-3 and Exponential distributions are 1/3 and 1, respectively. Therefore, Erlang-3, 
Exponential, and Gamma distributions correspond to the low, moderate, and high level variability 
in the service time processes, respectively. At each instance, we use the same distribution type for 
all service time processes. 

Table 1 shows the parameter choices related to the service time processes at each instance. For 
example, at instance 7, p% = 1/0.7125, server 5 is in heavy traffic, and servers 6 and 7 are in light 
traffic. Moreover, the service time processes associated with each activity are Erlang-3 distributed 
at this instance. In the first 18 instances, the downstream servers, i.e. servers 6 and 7, are in light 
traffic. Note that we prove the asymptotic optimality of the proposed policy under this assumption 
(cf. Assumption 3 Part 6). However, in order to test the robustness of the proposed policy, we 
consider the cases in which server 6 or 7 is in heavy traffic in the last 18 instances. 

Even though Wf weakly converges to the same limit independent of the control (cf. Proposition 
2), W'[ is policy dependent in the pre-limit. Since our class of admissible scheduling policies is 
very large, and includes the ones that can anticipate the future, we cannot construct a pre-limit 
lower bound based on the solution of DCP (30) given in Proposition 3. Hence, there is no pre¬ 
limit lower bound on performance. Furthermore, we cannot use the DCP solution to develop an 
approximate pre-limit lower bound on the average holding cost, because that requires knowing the 
stationary distribution of the relevant SRBM and that is only straightforward to find under very 

5 For convenience in notation, we have dropped the superscript r on the parameters. The reader should understand 
that each set of parameters (A a , \b, pi, P 2 , ■ ■ ■, pi) is associated with a particular r, and is not the limit parameter that 
appear in Assumption 3. 
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Table 1: Parameter choices at each instance. 


M3 

M5 

M6 

Mr 

Service Time Variability 

Instance 

1/0.95 

1/0.7125 

1/0.35 

1/0.95 

1/0.7 

1/0.95 

1/0.7 

1/0.95 

1/0.7 

Low 

Moderate High 

1 

X 



X 



X 


X 

X 


2 

X 



X 



X 


X 


X 

3 

X 



X 



X 


X 


X 

4 

X 




X 


X 


X 

X 


5 

X 




X 


X 


X 


X 

6 

X 
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X 
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X 
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X 
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X 
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X 
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X 
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10 
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specific conditions (cf. Harrison and Williams [1987], Dai and Harrison [1992], Dieker [2011], Dai 
et al. [2014]). The relevant SRBM does not have a product form stationary distribution in our case 
(cf. Harrison and Williams [1987]), when it is at least two-dimensional. Therefore, we compare 
the performances of 5 different non-preemptive control policies. The first one is the non-preemptive 
version of the proposed policy (see Remark 1 for the definition). Since the proposed policy is the 
SDP policy and the static priority policy when \ a < P 3 < ha and ^3 > ha, respectively, the second 
and the third control policies that we consider are the non-preemptive versions of the SDP and the 
static priority policies, respectively. The fourth policy that we consider is the FCFS policy, in which 
whenever server 4 is ready to process a new job, it chooses the job which has arrived the earliest 
to the buffers 4 or 5. The fifth policy that we consider is the randomized policy , in which whenever 
server 4 is ready to process a job and if both of the buffers 4 and 5 are non-empty, server 4 chooses 
a type a job with half probability. If only one of the buffers 4 and 5 is non-empty, then server 4 
chooses the job from the non-empty one. 

We have used Omnet++ discrete-event simulation freeware in our experiments. At each instance 
associated with each control policy that we considered, we have done 30 replications. At each 
replication, we have created approximately 1 million type a jobs and 1 million type b jobs and we 
have considered the time interval in which the first 50, 000 type a jobs and the first 50,000 type b 
jobs arrive as the warm-up period. 

9.2 Simulation Results 

In this section, we present the results of the simulation experiments. We assume that hb = 1 in 
all experiments and consider various h a values such that h a > hb- In each experiment, we use the 
same h a value at all instances. 

Let V denote the set of the five control policies that we consider in the simulation experiments 
and Ql’ J (i) denote the average length of buffer k, k £ /C in replication j, j £ { 1 , 2 ,... ,30} with 
respect to policy p, p £ V at instance i, i £ {1,2,... ,36}. Then, Q^(i) := (1/30) Q P k\^) is 
the average length of buffer k corresponding to policy p at instance i. Note that, we have proved 
the asymptotic optimality of the proposed policy with respect to the objectives (7), ( 8 ), and (12) 
but not with respect to the average cost objective (9) (cf. Remark 5). Still, our results suggest that 
our proposed policy performs well with respect to (9), which is a natural objective to consider in 
simulation experiments. Hence, we use the objective (9). Recalling the equality (11), we only need 
to compute J p (i) := h a (Q^(i) + Q^(i)) + hb(Qg(i) + Qi 0 (i)), which is the average total holding cost in 
buffers 3, 6 , 7, and 10 per unit time corresponding to policy p at instance i. Let L(i) := min pe -p J p (i) 
denote the lowest average cost among the policies in V at instance i. 

We present the detailed results of the simulation experiments in Table 4, which is in Appendix 
C. This table contains Q%(i) + Q^(i) and Q§(i) + Q\o{i) for each i and p with their 95% confidence 
intervals. 

Table 2 shows the average and maximum deviations of the cost of the policies from the lowest 
realized average costs among the first and last 18 instances for different h a values. For example, for 
given h a , the “Avg.” and “Max.” columns corresponding to policy p and the first 18 instances are 


1 

18 


E 


i =1 


^100 X 


J p (i) - L{i) \ 

L(i) J ’ 


max 

ie{l,2,...,18} 


100 x 


J p (i) - L(i) 

L(i ) 


respectively. In the “All” row, “Avg.” (“Max.”) column corresponding to policy p denotes the 
average (maximum) of the values in the “Avg.” (“Max.”) column among all h a values. 

According to the results, the proposed policy in general performs the best with respect to both 
the average and maximum deviations from the lowest realized average cost. The SDP policy performs 
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Table 2: Average and maximum deviations of the cost of the policies from the lowest realized average 
cost. 


First 18 instances 


Proposed 

SDP 

Static 

FCFS 

Randomized 

ha 

Avg. 

Max. 

Avg. 

Max. 

Avg. 

Max. 

Avg. 

Max. 

Avg. 

Max. 

1 

1.6% 

6.4% 

1.8% 

6.5% 

9.1% 

30.7% 

3.7% 

9.5% 

4.1% 

8.7% 

1.25 

0.0% 

0.3% 

0.3% 

1.4% 

6.2% 

25.8% 

6.2% 

15.3% 

6.7% 

14.6% 

1.5 

0.1% 

0.8% 

0.5% 

1.7% 

5.3% 

22.2% 

10.0% 

20.7% 

10.6% 

20.2% 

1.75 

0.2% 

1.4% 

0.7% 

2.0% 

4.6% 

19.3% 

13.6% 

25.9% 

14.2% 

25.4% 

2 

0.3% 

2.0% 

0.8% 

2.3% 

4.0% 

17.1% 

16.9% 

30.8% 

17.5% 

30.4% 

3 

0.8% 

4.1% 

1.6% 

4.1% 

2.5% 

11.3% 

28.5% 

48.0% 

29.2% 

48.0% 

4 

1.3% 

5.8% 

2.5% 

5.8% 

1.8% 

8.4% 

37.8% 

65.3% 

38.6% 

65.5% 

10 

3.1% 

11.6% 

5.2% 

11.6% 

0.4% 

2.8% 

68.9% 

125.1% 

70.1% 

123.3% 

All 

0.9% 

11.6% 

1.7% 

11.6% 

4.2% 

30.7% 

23.2% 

125.1% 

23.9% 

123.3% 


Last 18 instances 


1 

4.1% 

17.0% 

4.1% 

16.1% 

9.5% 

25.6% 

0.8% 

5.2% 

1.6% 

6.5% 

1.25 

2.3% 

11.4% 

2.4% 

11.4% 

6.9% 

22.1% 

1.6% 

8.4% 

2.4% 

10.1% 

1.5 

1.4% 

7.4% 

1.5% 

7.4% 

5.3% 

19.3% 

2.9% 

11.1% 

3.8% 

13.0% 

1.75 

0.7% 

3.8% 

0.8% 

3.8% 

4.0% 

17.3% 

4.1% 

13.3% 

5.0% 

15.5% 

2 

0.2% 

1.4% 

0.4% 

1.4% 

3.2% 

15.8% 

5.3% 

15.3% 

6.2% 

17.7% 

3 

0.4% 

2.0% 

0.7% 

2.0% 

2.1% 

11.5% 

10.9% 

24.8% 

11.8% 

26.1% 

4 

0.6% 

2.5% 

1.1% 

2.5% 

1.6% 

8.8% 

15.2% 

35.9% 

16.2% 

37.3% 

10 

1.4% 

7.0% 

2.3% 

7.0% 

0.4% 

2.5% 

29.2% 

81.9% 

30.3% 

84.0% 

All 

1.4% 

17.0% 

1.7% 

16.1% 

4.1% 

25.6% 

8.7% 

81.9% 

9.7% 

84.0% 


very close to the proposed policy, which shows that it performs well even when p% > hA- As expected, 
the performance of the static priority policy becomes better as h a increases, and it performs the best 
when h a is much larger than hi,. Performances of the FCFS and the randomized policies become 
worse as h a increases, which is expected because these policies do not give more priority to type a 
jobs than type b jobs. On average, these two policies perform the worst with respect to both the 
average and maximum deviations from the lowest realized average cost. 

At the first 18 instances, when h a = 10, the static priority and the proposed policies perform 
the best and the second best, respectively. This is not surprising given that in all of the instances 
in which server 3 is in light traffic, both the static priority and the SDP policies are asymptotically 
optimal (cf. Remarks 4 and 6). The superior performance of the static priority policy in the pre-limit 
can be attributed to the high holding cost of type a jobs. 

At the last 18 instances, the proposed policy still performs the best in average, which suggests 
that the performance of the proposed policy is robust with respect to the processing capacities of 
the downstream servers. 

Another interesting result that we see from Table 2 is that the percentage deviation of the worst 
performing policy from the lowest realized average cost at the first 18 instances is much higher 
than the same deviation at the last 18 instances. This result is due to the fact that at least one 
of the downstream servers is in heavy traffic at the last 18 instances and this decreases the waiting 
time of the jobs for the ones that they are going to be matched in the join servers. In other words, 
synchronization requirements between the jobs in different buffers become less important in this case. 
Hence, the effect of the control policy on the system performance is less important when at least one 
of the downstream servers is in heavy traffic, which justifies Assumption 3 Part 6. Moreover, FCFS 
policy performs the best when h a = 1 at the last 18 instances because all jobs are equally expensive 
and this policy approximately matches the arrival time of the same type of jobs to the downstream 
buffers. 

Figure 5 shows 100 x ( J p (i ) — L{i))/L(i) for each policy p, p € V at each i, i £ {1,2,... , 18} 
when h a = 2 and hi, = 1. Since pa = Pb and h a = 2 hi,, it makes more sense to give more priority to 
type a jobs. This is why in addition to the policies in V, we also consider a randomized- 2/3 policy , in 
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which whenever server 4 becomes available and if both of the buffers 4 and 5 are non-empty, server 

4 chooses a type a job with 2/3 probability. If only one of the buffers 4 and 5 is non-empty, then 
server 4 chooses the job from the non-empty one. 

In Figure 5a, we see that the proposed and the SDP policies perform well at each instance. SDP 
policy performs well even at the instances {13,14,... , 18}, where ^3 > ha- However, the static 
priority policy does not perform well in the first 6 instances, where server 3 is in heavy traffic. This 
result is expected because when server 4 gives static priority to buffer 4 under the condition that 
server 3 is in heavy traffic, jobs in buffer 8 will wait for the jobs in buffer 7. In this case, it is 
more efficient to give priority to the type b jobs in server 4 and this is exactly what the SDP policy 
does. The static priority policy performs the best at the instances {13,14,... , 18}, where ^3 > ha- 
Moreover, its performance is close to the one of the SDP policy at the instances {7, 8 ,... , 12}, where 
A a < AD < HA- This result is not surprising because static priority policy is asymptotically optimal 
when server 3 is in light traffic (cf. Remark 6 ). On the one hand, at the instances {7, 8 , 9}, the static 
priority policy performs better than the SDP policy because server 5 is in heavy traffic at these 
instances, hence server 4 should not give priority to the type b jobs at all. On the other hand, at 
the instances {10,11,12}, SDP policy performs better than the static priority policy, because server 

5 is in light traffic at these instances; hence giving priority to type b jobs in server 4 decreases the 
waiting time of the jobs in buffer 10 . 



16 12 18 

Instance number 

(a) Proposed, SDP, and static policies. 



Instance number 

(b) Proposed, FCFS, rand., and rand.-2/3 policies. 


Figure 5: (Color online) Percentage deviations of the costs of the policies from the lowest realized 
average cost (L(i) for j e {1,2,..., 18}) in the first 18 instances when h a = 2 and hb = 1. 


Figure 5b shows that the FCFS and the randomized policies perform worse than the proposed 
policy at each instance. The randomized-2/3 policy performs well only at the instances {7, 8 ,... , 12}. 
Figure 6 , where we compare the static priority, randomized, and randomized-2/3 policies, explains 
this result in the following way. On the one hand, at the first 6 instances, we see that the random¬ 
ized and randomized-2/3 policies perform the best and the second best among these three policies, 
respectively. This implies that when server 3 is in heavy traffic, giving more priority to type a jobs 
decreases the performance. On the other hand, at the instances {7, 8 ,... , 18}, the static priority and 
randomized-2/3 policies perform the best and the second best among these three policies, respec¬ 
tively. This implies that when server 3 is in light traffic, giving more priority to type a jobs increases 
the performance. In summary, the processing capacity of server 3 affects the performance of the con- 
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trol policy significantly. This is consistent with our theoretic results showing that an asymptotically 
optimal policy should not give static priority to type a jobs when server 3 is in heavy traffic. 



Instance number 

Figure 6: (Color online) Percentage deviations of the costs of the static, randomized, and the 
randomized-2/3 policies from the lowest realized average cost (L(i) for i £ {1, 2,... , 18}) in the 
first 18 instances when h a = 2 and hi, = 1. 


Remark 11 Figure 5a shows that the SDP policy performs very well at the instances {13,14,... , 18} 
where H 3 > /a a- Moreover, Table 2 shows that the SDP policy performs close to the proposed policy 
at all cases. Hence, we conjecture that the SDP policy is asymptotically optimal even when H 3 > ha- 
When fj ,3 > ha, the proofs associated with the up intervals (cf. Section 8.1) do not work. Note 
that we consider the difference of two renewal processes with rates ha and H 3 , respectively in the 
proofs associated with the up intervals (cf. (63)}. We show that the renewal process with rate ha 
stays sufficiently close to the one with rate H 3 during the up intervals when H 3 < TA- However, this 
argument does not hold when H3 > PA- 


10 Extensions 

It is possible to extend our results to more complex networks. We first consider a fork-join 
network in which there are task dependent holding costs in Section 10.1. Next, we consider a fork- 
join network in which jobs fork into an arbitrary number of jobs in Section 10.2. Lastly, we consider 
fork-join networks with more than two job types in Section 10.3. In each of these extensions, we first 
construct and solve an approximating DCP, then interpret a control policy from the solution. Since 
we omit the asymptotic optimality proofs corresponding to these extensions, the suggested control 
policies are all heuristics. 


10.1 Task Dependent Holding Costs 

So far, we assume that the holding cost rate of a type a ( b ) job in buffers 1, 3, 4, 7, and 8 (2, 5, 
6 , 9, and 10) are the same and denoted by h a (hi,). In this section, we extend this assumption by 
considering task dependent holding cost rates. We denote the holding cost rate of a job in buffer k 
as hk for all k £ /C and we assume h^ > 0 for all k £ /C. In this case, our objective in the DCP is to 
minimize 


P 


10 


^ ~2h k Qk(t ) > 


\k=l 



Vi > 0, x > 0. 


( 110 ) 
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Since the control policy in server 4 has no effect on the costs in buffers 1, 2, 3, and 6, the objective 
(110) can be simplified to minimizing 

P (h 4 Q 4 (t) + h 5 Q 5 (t ) + h 7 Q 7 (t) + h 8 Q 8 (t) + h 9 Q 9 (t ) + h w Qio{t ) > , Vt > 0, x > 0. (Ill) 

By first simplifying objective (111) by (28), which is the instantaneous service process assumption 
in the downstream servers, we can modify the DCP (30) in the following way: For each x > 0 and 
t > 0, 

min P (h 4 Q 4 (t) + h 5 Q 5 (t ) + h 7 (q 4 (£) - Q 8 {t)^j + h 8 (Qs{t) - Q 4 (£)) 

+ h 9 (Qe(t) - Q 5 (0) + h w (Qn{t) - > x'j , (112) 

s.t. Q 4 (t) + —Q${t) = W 4 (t), 

h-B 

Qfc(^) > 0, for all k € {4, 5}. 

When we consider the DCP (112) path-wise, we have the following optimization problem for all 
£ € R + and cj in D except a null set: 

min h 4 Q 4 (uj t ) + h 5 Q 5 (uj t ) + h 7 ^Q 4 (w t ) - Q 3 (wt)) + ^8 (^ 3 (<*>*) - Q 4 (w t )) 

+ ^9 ^Qe(wt) — + ^io ^QsO^t) — Qe(wt)^ , (113) 

s.t. Q 4 (u t ) + —Qs{u t ) = W 4 (u t ), 

k’B 

Qki^t) > 0, for all k E {4, 5}. 

We can solve the optimization problem (113) by the following lemma whose proof is presented in 
Appendix B.6. 

Lemma 5 Consider the optimization problem 

min h 4 g 4 + h 5 q 5 + h 7 (q 4 - q 3 ) + + h 8 (g 3 - q 4 ) + + h 9 (g 6 - q 5 ) + + h 10 (q 5 - q e ) + , (114a) 

s.t. q 4 + —(?5 = (114b) 

hB 

q 4 >0, q 5 > 0, (114c) 

where q 4 and q§ are the decision variables and all of the parameters are nonnegative. Then, there 
exists an optimal solution among the four solutions given in Table 3 below. 

Table 3: An optimal solution set for the optimization problem (114). 


Solution # 

<74 

<75 

i 

0 

{Ub/Ua)w4 

2 

<73 A W4 

{ub/ua)(w4 ~ <73) + 

3 

(w 4 - (ua/ub)< 7e) + 

<76 a {ub/ua)w 4 

4 

W 4 

0 
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Remark 12 The optimal solution of Lemma 5 is strictly dependent on the parameters ( 93 , 96 , 1 ^ 4 ), 

i.e., the optimal solution can change as 93 , or q^, or w 4 changes. For example, consider an example 
in which fi a = hb = 1, h 4 = /15 = 1, hs = /iio = 2, = hg = 3, and ( 93 , 96 , tt> 4 ) = (0.01,0,1). Then, 

the objective function values corresponding to the four solutions in Table 3 are (3.02,2.98,3.97,3.97), 
respectively. This implies that an optimal solution is the second solution in Table 3, which is ( 94 , 95 ) = 
(0.01,0.99). Next, we consider the same example but this time (< 73 , q^.w^) = (1,0.01,1), i.e., <73 and 
q§ change. In this case, the objective function values corresponding to the four solutions in Table 3 
are (4.98,1.03,1.02,1.03), respectively. This implies that an optimal solution is the third solution in 
Table 3, which is ( 94 , 95 ) = (0.99,0.01). 


By Lemma 5, for each t € and oj in except a null set, an optimal solution of DCP (113) is 
among the four solutions presented below, which then motivates a control policy. 


1. If 


(^Q i{uit) 1 Qs(<*4)) 


—W 4 (io t ) 

FA 


then server 4 should give static priority to type a jobs. 


(115) 


2. If 

(Q 4 M,Q 5 (w t )) = (q 3 M A^(«t), ^ (wt(u t ) - QsM) + ) , (116) 

then server 4 should use the proposed policy. Note that the solution in (116) is the same as 
the one in Proposition 3. 


3. If 




Q 6 (ujt) A —W 4 (ujt) 
FA 


(117) 


then server 4 should use the proposed policy but this time it should prioritize type b jobs over 
type a jobs. In other words, if /rg < h 5 , server 4 should give static priority to type b jobs; 
otherwise, it should use the SDP policy to pace the departure process of type b jobs from buffer 
5 with the ones from buffer 6 . 


4. If 


(<2 4 (wt), <35(0-4)) = ( Wi ( u t ), 0), 


then server 4 should give static priority to type b jobs. 


(118) 


Since the optimal solution can change as the process (Q 3 (wt), Qe(oJt), changes with time 

(cf. Remark 12), we interpret the following dynamic control policy: Whenever server 4 makes 
a service completion, the system controller computes the objective function values of DCP (113) 
corresponding to the four solutions in (115) - (118) and implements the control policy corresponding 
to the solution with the minimum objective function value until the next service completion epoch 
in server 4. 


10.2 Networks with Arbitrary Number of Forks 

In this section, we consider the network presented in Figure 7, in which type a and b jobs fork 
into g\ + 1 and 92 + 1 number of jobs, respectively, where 91,92 £ N+; and server 4 is the only server 
which processes both job types. 

Parallel with objective (12), we consider the objective of minimizing 

P {ha(g 1 + 1) {Q\jL{t) + Q r D L.{t)^ + hb(g 2 + 1) ^Q^ij(t) + l?£)fl(^)) > *) j (119) 
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Type a jobs ( h a ) 


Type b jobs (h b ) 



Figure 7: A fork-join processing network with two job types and arbitrary number of forks. 


for all t € and x > 0. We assume that servers 6 and 7 are in light traffic, hence we can assume 
that the service processes in these servers are instantaneous as in Section 4. Then, similar to (28), 
we have 


Qn f 


= Ql v 


max 


Qu l 


- Qrrp 


Q r DH = (qI v (max^ Q r u «) - <&*) + ■ ( 12 °) 


From (120), defining h a := h a (gi + 1) and hb '■= hb(g 2 + 1), and the fact that x + (y — x) + = x V y 
for all x,y G M, the objective (119) is equivalent to minimizing 


P (h a (qI^) V f max + h b (qW) v ^. g max > aA , (121) 

for all t € M + and x > 0 . 

Parallel with Proposition 2, we can prove the following weak convergence result under any work- 
conserving control policy: 

(Q r i,Q r 2, Qijl j * £ {!) 2 ,..., g\}i Q^jRij € { 1 , 2 ,... , 32 }, t?x) 

=> (Qi,Q 2, e {1,2,... ,51}, QjjR,j € {1,2,... ,52}, W4) , (122) 

where the limiting process is the zero process for the buffers whose corresponding dedicated server is 
in light traffic and an SRBM for the buffers whose corresponding dedicated server is in heavy traffic 
and the workload process in server 4. Then, by (121), (122), and using the technique that we use to 
derive the DCP (30), we construct the following DCP for this network. For each x > 0 and t > 0, 


min 


P ( h a ( Q A (t) V 


max Q T tl (t) 


+ hb ( Qb{t) V 


max Q T tr (t) 

ie{i,.., 32 } Ui 


> x 
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(123) 


s.t. Q 4 (t) + —Q b {t) = W 4 (t), 

kB 

> 0, for all k G {4, 5}. 


When we consider the DCP (123) path-wise, we have the following optimization problem for all 
t € M+ and w in fi except a null set: 


min 

s.t. 


(qa(uh) v 


max QrrLiut) 


Q 4 (^t) + - — Qbi^t) — W 4 (ut), 

kB 

Qk{iOt ) > 0, for all k G {4, 5}. 




+ h b I Q§{wt) V 


max QjjRUjJt) 


(124a) 

(124b) 

(124c) 


where Qk{^t), k G {4,5} are the decision variables. Note that the objective function (124a) is equal 
to 


h„ ( max QrjL(ut) + h a \ Q±{u t ) - max QrjL(uj t ) 
^6(1, ...,si} 1 ) \ \*e{i,...,gi} 


+ h b ( max Quaint) ) + h b ( Q b {io t ) 
V*e{i,..., 32 } 1 ) \ 


max QrrRiuJt) 

fc{i,..., 92 } Ui 


(125) 


Since all of the buffers U- J . i G {1,... ,g\} and U^, i G {1,..., <72 } are independent of the control, 
minimizing the objective in (125) is equivalent to minimizing the objective (126a) below, thus the 
optimization problem (124) is equivalent to the following one: 


min 

s.t. 


h a ( Qiivt) ~ ( max Q uL (u t ) 

Qi{^t) + = 1 

k-B 

Qk{^t) > 0, for all k G {4, 5}. 




+ h b | 


max Q rj R(uj t ) 

*e{i,...,92} 1 


(126a) 

(126b) 


(126c) 


Without loss of generality, let us assume that h a ^A > h b kB■ Then by Lemma 1, an optimal solution 
of the optimization problem (126) is 


(Q 4 , Q 5 ) 


= I W 4 A 


max Qt,l 

ie{l,...,9i} i 


kB 

kA 


W 4 - 


max Qt,l 
*G{ 1 ,..., 9 i} i 



(127) 


We can interpret the solution (127) in the following way: If the processing capacity of each of the 
servers Li, i G {1, 2,... ,g{\ is greater than or equal to kA, then server 4 should give static priority 
to type a jobs all the time. Otherwise, server 4 should give priority to type a jobs whenever the 
number of jobs in buffer 4 is strictly greater than the maximum of the number of jobs in buffers 
Uj J , i G {1,2,... , <71 }. In other words, server 4 should pace the departure process of type a jobs 
from buffer 4 with the minimum of the ones from the buffers 17- , i G {1,2,... ,g\]. Hence, we see a 
slightly different version of the SDP policy. 

10.3 Networks with More Than Two Job Types 

In this section we consider fork-join networks with more than two job types. We first consider 
a network with an arbitrary number of job types and a single shared server in Section 10.3.1, and 
then consider a network with three job types and two shared servers in Section 10.3.2. 
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10.3.1 Networks with Arbitrary Job Types and a Single Shared Sever 

In this section, we consider the network presented in Figure 8, where there are n job types such that 
n is arbitrary and n > 2. Server 1 is a shared server which processes all job types, whereas all other 
servers process only a single job type. Upon arriving to the system, each job is first processed in a 
server, then it is forked into two jobs which are processed in the shared server and in a dedicated 
server, respectively, lastly the two forked jobs are joined in the corresponding downstream server and 
leave the system. 


Type 1 jobs [h t ) Type 2 jobs ( h 2 ) Type n jobs ( h n ) 

L l*J .- l*J 



Figure 8: (Color online) A fork-join processing network with arbitrary job types and a single shared 
server. 


A type j job can wait in buffers j, ( j , 2), (j, 3), and (j. 4), where buffer j feeds the shared server, 
buffer ( j , 2) feeds the dedicated server, buffer (j, 3) is fed by the dedicated server, and buffer ( j , 4) is 
fed by the shared server (cf. Figure 8). Server 1, the shared server, processes type j jobs with rate 
fij for all j G {1, 2,... , n}. Let hj denote the cost per unit time to hold a type j job in the system. 
Without loss of generality, we assume that h\^i > h 2^2 > ... > h n [i n . We assume that server 1 is 
in heavy traffic and all of the downstream servers are in light traffic. Parallel with (17), let 


3= 1 


Q 

7 


denote the diffusion scaled workload process that server 1 sees. Then, parallel with Proposition 2, 
we can prove the following weak convergence result under any work-conserving control policy: 

(Q) i2 ,je{l,2,...,n}, ? r ) =}► (Qj, 2 , j £ { 1 , 2 ,..., n}, w) , (128) 

where the limiting process is the zero process for the buffers whose corresponding dedicated server is 
in light traffic and an SRBM for the buffers whose corresponding dedicated server is in heavy traffic 
and the workload process in server 1. Since we assume that the downstream servers are in light 
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traffic, we can assume that the service processes in these servers are instantaneous. Then, similar to 
(28), we have 

Qj ,3 = (Q'j ~ Qj , 2 ) + , Qj,4 = (Qj ,2 - Qj) + , Vj €{1,2 ,..., n}. 

By using the technique that we use to derive DCP (30), we construct the following DCP for this 
network. For each x > 0 and t > 0, 


min 



s.t. ^2Qj(t)/nj = W(t), 

3 =1 


Qj(t) > 0, for all j €{1,2 ,..., n }. 


(129) 


When we consider the DCP (129) path-wise, we have the following optimization problem: 

n _ _ _j_ 

min ^(Qj-M-Qi.ate)) , 
i=i 

n 

s.t. y ^Qj(cJt)/nj = W(u t ), (130) 

3 =i 

Qj(u3t) > 0, for all j € {1,2 ,...,n}. 

where Qj(ut), j € {1,2 ,..., n} are the decision variables. Note that, since Qj .2 is an exogenous 
process for all j € {1,2 ,... ,n} (cf. (128)), we neglect the term Y^=i ^jQj^i^t) i n the objective 
function of the optimization problem (130). We can solve the optimization problem (130) by the 
following lemma whose proof is presented in Appendix B.7. 


Lemma 6 Consider the optimization problem 


n 


min h 3 (Qj ~ Qj, s) + j 

j= i 

(131a) 

n 

s.t. y Q 3 / 1 C = 

7 — 1 

(131b) 

J 

Qj > 0, for all j G {l,2,...,n}. 

(131c) 


where qj, j € {1,2, ...,n} are the decision variables, all of the parameters are nonnegative, and 
h\pi > h> 2 P 2 > ■ ■ ■ > h n fi n . Then, an optimal solution of the optimization problem (131) is 


qi = <71,2 a ihiw), 


Qj 


Qj,2 A 


Tj 



Qn — h’Ti 




72,2 

T2 



+ 


Qn- 1,2 
hn —1 


(132a) 

Vj € {2, 3,... , n — 1}, (132b) 

(132c) 
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Remark 13 Whenn = 2, the solution of Lemma 6 is the same as the solution of Lemma 1 by the fact 
that the notation (pi, p 2 , Qi, 2 , Q 2 , 2 , w) in Lemma 6 corresponds to the notation (ha, Pb, Q 3 , Q 6 , pa) 
in Lemma 1. 


Remark 14 In the optimal solution (132), we see that hj ( q,j — qjp) + = 0 for all j £ {1, 2,... , n — 1} 
but h n ( q n — q n p) + can be strictly positive depending on the parameters. This implies that the objective 
function (131a) can become positive only because of type n jobs. In other words, cost can occur only 
because of the cheapest job type. 


Lemma 6 implies that an optimal solution of the optimization problem (130) is 

Qi = Qi ,2 a (mi W), 


(133a) 


Qj — Qj, 2 a 


Qn — hn 


+n 


hj 


W - 


Q 1,2 

hi 


Q 2,2 

h2 


Qj- 1,2 


i Vj £ {2,3,..., n — 1}, 


W - 


Q 1,2 


Q 2,2 


Qn— 


1,2 


y y hi j h2 j hn- 1 

Then, we can interpret the following control policy from the solution (133): 
• The priority ranking of the job types is 1 > 2 > ... > n. 


Tj-i I 



(133b) 

+ 

(133c) 


• Type 1 jobs: If p± is less than or equal to the processing rate of the dedicated server for type 1 
jobs, server 1 should give static priority to type 1 jobs all the time. Otherwise, server 1 should 
give priority to buffer 1 only when the number of jobs in buffer 1,2 becomes less than the one 
in buffer 1. During the remaining time, server 1 should process the job types j £ {2,... , n}. 

• Type j jobs, j £ {2,..., n — 1}: Server 1 gives priority to type j jobs only when the higher 
priority job types ({1, ..., j — 1 }) do not require any processing. Consider the time intervals 
in which server 1 gives priority to type j jobs. If hj is less than or equal to the processing rate 
of the dedicated server for type j jobs, server 1 should give static priority to type j jobs all 
the time. Otherwise, server 1 should give priority to buffer j only when the number of jobs in 
buffer j, 2 becomes less than the one in buffer j. During the remaining time, server 1 should 
process the job types i £ {j + 1 ,... ,n}. 

• Type n jobs are processed during the remaining time. 


Therefore, we see a chained implementation of the proposed policy. The throughput rate of type 
j jobs is maximized by keeping buffer j less than or equal to buffer (J, 2) with minimum effort for 
all j £ {1,... ,n — 1} (cf. (133a), (133b), and Remark 14). By this way, server 4 gives as much as 
priority to type n jobs. 


Remark 15 It is possible to extend the network in Figure 8 to the case where each job type forks 
into arbitrary number of jobs (cf. Figure 1 ). Suppose that type j jobs fork into gj + 1 jobs and Qj, 2 ,i, 
z £ { 1 , 2 ,... ,gj} denote the buffers in front of the dedicated servers corresponding to the type j jobs. 
Then by the DCP (123) and the optimization problem (126) in Section 10.2, we need to replace Qjp 
with maxj g { li ...,gj}Qj, 2 ,i in both the DCP (129) and the optimization problem (130). We can still 
solve the modified version of the optimization problem (130) with Lemma 6 . The control policy that 
we can interpret is a modification of the control policy that we interpret from the solution (133) with 
the one from the solution (127). 


45 










10.3.2 Networks with Three Job Types and Two Shared Servers 

In this section, we consider the network presented in Figure 9, where there are three job types, 
namely type a, b, and c jobs. There are two shared servers, which are servers 5 and 6. We formulate 
and solve the approximating DCP in order to derive heuristic control policies. In contrast with the 
networks in Figures 7 and 8, the DCP solution is not a straightforward extension of the DCP solution 
presented in Section 4. 

Type a jobs (h a ) Type b jobs (h b ) Type c jobs (h c ) 


ill lLi ill 



i i i 


Figure 9: A fork-join processing network with three job types and two shared servers. 


Server 5 processes both type a and b jobs with rates ha an d Hbi, respectively; and server 6 
processes both type b and c jobs with rates HB2 and hc-, respectively. Let h a , hb, and h c denote the 
holding cost rate per job per unit time for type a, b, and c jobs, respectively. Suppose that both 
servers 5 and 6 are in heavy traffic, and servers 8, 9, and 10 are in light traffic. Let 


Abi 


W£-.= Q r 7 + ^Ql 
he 


denote the workload process (up to a constant scale factor) in servers 5 and 6, respectively. Then, 
parallel with Proposition 2, we can prove the following weak convergence result under any work- 
conserving control policy: 


(Q r i,Q r 2,Q r 3,Ql,Q r 9,w£,wz) => (Qi,Q 2 ,Q3,Q4,Q9,W 5 ,W' 6 ) , (134) 

where the limiting process is the zero process for the buffers whose corresponding dedicated server is 
in light traffic and an SRBM for the buffers whose corresponding dedicated server is in heavy traffic 
and the workload processes in servers 5 and 6. Since servers 8, 9, and 10 are in light traffic, we can 
assume that the service processes in these servers are instantaneous; and similar to (28), we have 


Q r w = (Ql - Ql) + , Q \ 5 = (Q r s - Ql) + , Ql + Qn = Ql + (Qr ~ Qe) + = Ql v Q r 7 . 
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Then, by using the technique that we use to derive DCP (30), we construct the following DCP for 
this network. For any x > 0 and t > 0, 

min P ^ 'h a ^Q 4 (t) + (QsW ~ Q^tjj ^ + h b ( Q 6 (t ) V Q 7 {t)j 

+ h c ^Q 9 (t) + ^Qs(t) ~ Qg(i)) 

s.t. Q 5 (t) + Q 6 (t ) = W 5 (t), (135) 

Mb i 

Qr(t) + —Qa(t) = W 6 (t), 

MC 

Qk(t ) > 0, for all k € {5,6,7, 8 }. 

When we consider the DCP (135) path-wise, we have the following optimization problem: 

min h a (Q 5 (u)t) - Q 4 (u t )j + h b (Qe(u t ) V Q 7 (wt)) + h c ^Q 8 (w t ) - Qg(w t )) > 

s.t. Q 5 (wt) + -t-^-Qeiut) = W 5 {ut), (136) 

MBI 

Qri^t) + ^-Qa{u t ) = TP6 (<^t), 

MC 

Qk{wt) > 0 , for all A; € {5,6,7, 8 }, 

where Qk{^t), k G {5, 6 , 7, 8 } are the decision variables. Note that, since servers 4 and 7 are indepen¬ 
dent of control, we neglect the term h a Q 4 (ut) + h c Qg(ut) in the objective function of the optimization 
problem (136). Next, we will solve the optimization problem (136) case by case. 

First, let us consider the case hbMB l > h a MA and hbMB 2 > h c Mc■ We present the optimal solution 
in the following result. 


Lemma 7 When hbMBi > h a MA and hbMB2 > h c Mc, an optimal solution of the optimization problem 
(136) is the following: 

1. If hb > h a MA/M bi + h c Mc/MB 2 , an optimal solution is 

($ 5 , Qe, Q 7 , Qs) = (w 5 , 0 , 0 , . (137) 

2. If hb < h a MA/M bi + h c Mc/MB 2 , an optimal solution is 


Q 5 = max 


Qa A W 5 , 


W 5 




(138a) 


Q 6 


Qy = min 




(138b) 


Qs 


= max i.Qg A ——Wq, 
{ MB2 




(138c) 
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The proof of Lemma 7 is provided in Appendix B. 8 . Note that the optimal solution has different 
structures depending on the cost parameters. On the one hand, hb > haHA/UBi + h c nc/l^B 2 intu¬ 
itively implies that the holding cost of a type b job is greater than the sum of the holding costs of a 
type a and type c job. Hence, {Qq, Q 7 ) = ( 0 , 0 ) in (137) and this implies that we should give static 
preemptive priority to type b jobs in servers 5 and 6 all the time. 

On the other hand, max{/t a /r^/ h c nc / HB 2 } < hb < h a ^A/HBi + h c nc /hB 2 intuitively implies 
that the holding cost of a type b job is greater than the one of a type a or type c job but less than 
the sum of the holding costs of a type a and type c job. In this case we interpret the solution (138) 
in the following way. If we decrease both Qq and Q 7 in (138b), which is equivalent to giving more 
priority to type b jobs, then jobs will accumulate in buffers 10 and 15 waiting for the corresponding 
jobs to arrive buffers 11 and 14, respectively. Hence, the throughput rate of type b jobs will increase 
but the throughput rate of both type a and c jobs will decrease, which is not desired because 
hb < h a iiAlH b\ + h c [ic / HB 2 - If we decrease only Qq in (138b), then the throughput rate of type a, 
b, and c jobs decrease, stays the same, and stays the same, respectively; and this is not desired. The 
case in which we decrease only Q 7 in (138b) follows similarly. Next, suppose that we increase Qe(t) 
or Qj(t) in (138b), which is equivalent to giving less priority type to b jobs. Suppose that Qe(t) = 
Qi{t) = (^Bi/HA)(W 5 (t) - Q 4 (t)) + in (138b) for some t G R+. Since Qio(t) = (Qs(t) - (? 4 (i)) + , 
by substitution Qio(t) = 0 in this case, i.e., throughput rate of type a jobs is in the maximum level. 
Then, when we increase Qe(t) or Qj{t ), the throughput rate of type a, b, and c jobs stays the same, 
decreases, and (in the best case) increases, respectively. Since h c fjLc/hB 2 < hb, this result is not 
desired. The case Qe(t) = Qi{t) = (W 6 (t) - {^32/hc)Qd{t)) + in (138b) follows similarly (for more 
intuition see the proof of Lemma 7 and Remark 16 in Appendix B. 8 ). 

Therefore, (138) is the optimal solution and we interpret the following policy from it: Whenever 
(138b) does not hold, if Qq (Q 7 ) is strictly greater than the right hand side of (138b), then server 5 
( 6 ) should give preemptive priority to type b jobs; if Qq (Q 7 ) is less than or equal to the right hand 
side of (138b), then server 5 ( 6 ) should give preemptive priority to type a (c) jobs. 

Second, let us consider the case h a /j,A > hb/J-Bi and hbiiB 2 > h c ^c- We can see that 

(Q5,Q6,Q7,QsJ = ^Qi/\W 5 , (w$ — Q 4 J , 



is an optimal solution of the optimization problem (136) for all t > 0 and u in H except a null set. 
Since the derivation of (139) is very similar to the proof of Lemma 7, we skip it. 

We can interpret a control policy from (139) in the following way. If server 4 processes type a 
jobs with a faster rate than server 5 does, then server 5 should give static priority to type a jobs. 
Otherwise, server 5 should use the SDP policy to pace the departure process of the type a jobs 
from buffer 5 with the one from buffer 4. Similarly, server 6 should use the SDP policy to pace the 
departure process of type b jobs from buffer 7 with the one from buffer 6 . 

Third, let us consider the case h a fiA > hb^Bi and h c [ic > hb^B2- We can see that 

(^Qbi Qgi Qh Q 8 ^ = A W 5 , Q 4 J t (wq — , ~J^Wq A Qg\ (140) 

is an optimal solution of the optimization problem (136). Since the derivation of (140) is very similar 
to the proof of Lemma 7, we skip it. 
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We can interpret a control policy from (140) in the following way. If server 4 (7) processes type 
a (c) jobs with a faster rate than server 5 (6) does, then server 5 (6) should give static priority to 
type a (c) jobs. Otherwise, server 5 (6) should use the SDP policy to pace the departure process of 
type a (c) jobs from buffer 5 (8) with the one from buffer 4 (9). 

Lastly, the case > h a /iA and hrfic > hb^B 2 is equivalent to the second case that we 

consider, hence we skip it. 

It is possible to construct DCPs for networks with more than three job types and more than two 
shared servers by the same methodology that we use to construct DCP (135). However, finding a 
closed-form optimal solution and interpreting a heuristic control policy is not trivial. As the number 
of shared servers increases in the network, the dimension of the corresponding DCP, which is the 
number of workload constraints, increases and finding a closed-form optimal solution becomes quite 
challenging. For example, there is only a single shared server in the networks considered in Figure 2 
and in Sections 10.1, 10.2, and 10.3.1. As a result, all of the corresponding DCPs (cf. (30), (112), 
(123), and (129)) are single dimensional, i.e., each DCP has a single workload constraint. However, 
when there are two shared servers as in the network in Figure 9, the corresponding DCP (135) has 
two workload constraints and is more difficult to solve than the single dimensional ones. 

In the queueing control literature, there are many studies (cf. Harrison [1998], Harrison and 
Lopez [1999], Bell and Williams [2001], Mandelbaum and Stolyar [2004], Ata and Kumar [2005], Dai 
and Lin [2008]) in which the corresponding DCP is first converted to a single dimensional equivalent 
DCP and control policies are obtained by solving the latter DCP. However, in general, how to control 
stochastic networks whose equivalent lower dimensional DCP has more than one dimension is an open 
problem. 

In order to overcome the curse of dimensionality in the networks with more than three job types 
and more than two shared servers, one possible solution is to decompose the network into the ones 
with at most three job types and use the control policies mentioned in this study to control each 
sub-network. For example, in Figure 10, there are seven different job types arriving to the system, 
and the network is divided into three sub-networks. Then the important questions are: 1) How can 
a large network be decomposed into smaller ones? and 2) Which control policies should be used 
in the servers which belong to two different sub-networks? If there exists a server which processes 
more than two different job types and is in light traffic, the network can clearly be decomposed from 
this server and any work-conserving control policy can be used in this server, because its processing 
capacity is high. Otherwise, the answers are not trivial and requires further research. 

11 Concluding Remarks 

The synchronization constraints in fork-join networks complicate their analysis. In this paper, 
we have formulated and solved approximating diffusion control problems (DCPs) for a variety of 
fork-join networks (depicted in Figures 1, 2, 7, 8, and 9). The DCP solutions have motivated either a 
slow departure pacing control or a static priority control, depending on the network parameters. We 
have rigorously proved the asymptotic optimality of our proposed control for the network in Figure 
2 , and we conjecture that the proofs will extend in a straightforward manner to the networks in 
Figures 1, 7, 8, and 9 and the network in Figure 2 with task dependent holding costs. 

Our proposed controls will not change when light traffic queues 6 are added to the fork-join 
networks that we have studied. Furthermore, minor modifications to our proposed policies will ac¬ 
commodate the addition of heavy traffic queues before the shared or dedicated servers. For example, 
if we add a heavy traffic queue between server 1 and buffer 3 in Figure 2, even though our proofs 
do not cover this case, we can see that the proposed policy (or a slightly modified version of it) is 

6 What we mean by a queue is a server and its corresponding buffer, e.g., server 3 and buffer 3 is a queue in Figure 

2 . 
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Figure 10: A fork-join processing network with seven job types and six shared servers. 


still asymptotically optimal. In this modified system, if server 3 is in light traffic, then the modified 
network is equivalent to the one where server 3 is replaced by the newly added heavy traffic queue 
and all the other servers have the same processing capacities with the ones in the original network. If 
server 3 is in heavy traffic in this modified system, then the proposed policy behaves in the following 
way: When the total number of jobs in buffer 3 and the buffer corresponding to the newly added 
queue is less than the one in buffer 4, then server 4 gives priority to buffer 4; otherwise server 4 gives 
priority to buffer 5. 

The complicated case is when there are heavy-traffic queues after the shared servers. Then, it 
is not clear either what the proposed policy should be or how to prove an asymptotic optimality 
result. An excellent topic for future research is to develop control policies for the broader class 
of fork-join networks with heterogeneous customer populations described in Nguyen [1994], More 
specifically, that paper assumes FCFS scheduling, but we believe other control policies can lead to 
better performance. 
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APPENDIX 

The appendix is organized into three sections: A, B, and C. Section A derives the convergence rate 
of the queue length process in a light traffic GI/GI /1 queue, which is used in the weak convergence 
proof in the main body (the proof of Proposition 5). Section B presents the proofs of all results 
in the paper with standard methodology. Section C provides the detailed results of all simulation 
experiments mentioned in Section 9. 
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A Proof of Proposition 5 

Without loss of generality, we will prove Proposition 5 only for r G N + . Let U( 0) := 0, P(0) := 0, 
k 

U(k ) := V7c G N+, and A{t) := maxjfc 6 N : !/(/;) < t}, Vf > 0, 

i= 1 
k 

V{k) := Vi , V7e € N+, and 5(f) := max{fc € N : V(k) < t}, Vf > 0. 

2=1 

Let T(f) ( I(t )) denote the cumulative amount of time the server is busy (idle) up to time t, t > 0. 
Then, T(t ) + /(f) = t for all t > 0 and 

Q(t) = Q(0) + A(t)-S(T(t)), Vf> 0. 

Let us define the following shifted processes. For all t > 0 and i G N, 

I r '\t) := I{ir +1) - I(ir), T r ’\t) := T{ir +1) - T(ir), 

A r,i (t) := A{ir + t) - A(ir) - At, S r '\t) := S(T(ir + t)) - S(T(ir)) - pT r ’\t), 

Q r,i (t) := Q(ir + t), X r,i (t) := Q(ir) + A r ’\t ) - S r ’\t) + (A - //)t, 


Then, after some algebra, for all t > 0 and i G N, we have 

Q r,i (t) = X r,i (t) + pl r,i (t). 

For all i 6 N and r G N + , since I r,l (0) = 0, is nondecreasing, and / 0 °° Q r, *(t)d/ r, *(t) = 0, we 

have 

(Q r ’ i ,Hl r ’ i ) = ($,T)(A^). 

Next, for all r G N+, let us define the following sets: 


<? r = Q(0) < 


((M — A) A e)r 


n n Ill’ll, vll' 57 ' 

' i =0 ^ 


< 


((/j- A) A e)r 
8 


v- .n"‘ {«*•)< M^LL 

i =0 ' 


(141) 

(142) 


We have the following result related to (141) whose proof is presented in Appendix A.l. 


Lemma 8 For all e > 0, there exist r 2 > 0 and strictly positive constants C 4 , C 5 , C$, and Cj which 
are independent of r such that if r > r 2 , 

P (||A r,i || r > er) < C 4 r n e- Csr , (143) 

P(ll 5r,< llr> er ) <C 6 r n e~ c ' r , (144) 

for all i G {0,1,..., r n_1 — 1}. 

Next, we will show that Q r C W by induction. First note in the set Q r , (Q(0) < ((p — A) A e)r/2}. 
Next, suppose that, in the set Q r , {Q(jr) < ((p — A) A e)r/2} for all j G {1,2,..., i} for some i G 
{0, 1,... , r n_1 — 2}. Then, in the set Q r , 

Q((i + 1)0 = Q r '\r) = X r,i (r) + pF\r), 
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= Q{ir) + A r,i (r) - ST’^r) + (A - »)r + sup (~Q(ir) - A r '\t) + S r ’\t) + (ji - A )t) + , 

0 <t<r 

<Q(ir) + ——^ + (A — n)r + sup (-A r,l (t) + S r,l (t)) + + sup (—Q(ir) + (/r - X)t) + , 


o <t<r 


0 <t<r 


< —— ^ A e ^ r + Q(ir) + (A - /i)r + sup (- Q(ir ) + (// - A)t)" 


0<t<r 


((^ ~ A) A e)r 
2 

((M ~ A) A e)r 
2 

((M-A) Ae)r 


+ Q(ir) + (A - fj)r + (-Q(ir) + (// - A)r) V 0, 
+ Q(ir) + (A - /j,)r - Q{ir) + {n~ A)r, 


where the first and second inequalities are by (141) and second to last equality is by the induction 
hypothesis. Therefore, Q r C T~L r . 

Next, note that the probability in left hand side of (59) is equal to 


r r n-i_i 

U 

i =0 


sup Q r ' l (t) >re\) < P 

0 <t<r 


fr n ~ 1 -1 

f "j 

U ] 

sup Q r - l (t) > re 

\ i=0 

(0<t<r J 

( r n 1 — 1 

f ) 

u j 

sup Q r - l (t) > re 

\ i= 0 

lo <t<r J 


First, let us consider the first probability in (145). Let us fix an arbitrary r £ N+. In the set Q r <l'H r , 
for all i £ {0,1,... , r n ~ l — 1}, we have 

sup Q r \t)= sup {X r '\t) + ^(t)} , 

0<t<r 0<t<r 

= sup i.Q(ir) + A r,l (t) - S r ’ l (t) + (\-n)t+ sup (-Q(ir) - A r,l (s) + S r,l (s) + (/i - A)s) + 1 , 
0 <t<r { 0 <s<t J 

< —— )) A e ^ r _|_ SU p f Q(ir) + (A — jj,)t-h sup (— Q(ir) + (// - A)s)" 

4 0 <t<r l 0 <s<t 

((fi - A) A e)r 


((/i - A) A e)r 


+ sup (Q(ir) + (A - n)e) (t), 

o <t<r 

+ Q(ir), 


< ((// - A) A e)r, 

where the hrst and last inequalities are by (141) and (142), respectively. Therefore, 

y.n — 1 _ 1 


U ( sup Q r \t ) > re) D Q r D U r = 0, 
(To ^<t<r J 


(146) 


( 147 ) 


by (146), which implies that the first probability in (145) is equal to 0. 

Second, let us consider the second probability in (145), which is less than or equal to 

((/i - A) A e)r 


P Q(0) > 
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‘-1 


+ E p IK 


> 


((/i-A) Ae)r 
8 


L -i 


+ E p |ML> 


((/i-A) Ae)r 
8 


(148) 


i=0 x 7 z=0 

by (141). Let rq := ro V r 2 - Then, by (58) and Lemma 8 , if r > rq, (148) is less than or equal to 

Cor 2 n " 1 e“ Cir + C' 4 r 2 n - 1 e“ C5r + C 6 r 2 n - 1 e~ C7r = C 2 r 2n ~ l e" C3r , (149) 

where C 2 := Co + C 4 + Cg and C 3 := min{Ci, C 5 , C 7 }. Therefore, (145), (147), and (149) prove 
Proposition 5. 

A.l Proof of Lemma 8 

We first present some preliminary results in Appendix A.1.1, second prove (144) in Appendix 
A.1.2, and lastly prove (143) in Appendix A.1.3. Let us fix an arbitrary i £ {0,1,... ,r n_1 — 1}. 
Without loss of generality, we choose e > 0 such that e < p. 

A. 1.1 Preliminary Results 

Note that 

||S TV || r = sup |<S' r,l (t)| = sup \S(T(ir + t)) — S(T(ir)) — pT(ir + t) + pT(ir )\, 


0 <t<r 


0 <t<r 


< sup \S(T(ir) +t) — S(T(ir)) — fi(T(ir) +t) + fiT(ir)\ 

0<t<r 

= sup \S(T(ir)+t) — S(T(ir)) — fit\ , 

0 <t<r 


(150) 


where the hrst inequality is by the fact that for all t > 0 , there exists an s £ [ 0 , t] such that 
T(ir + 1) = T(ir ) + s, because T(-) is nondecreasing and T(t ) < t for all t > 0. For all m £ N, let 

•Fm := <? {uk, k £ N_|_; v k ,k £ {1,2,... ,m + 1}} . 

Clearly, {T m ,m £ N} is a filtration. Then, similar to Lemma 8.3 of Williams [1998a] and Lemmas 
7.5 and 7.6 of Bell and Williams [2001], we have the following result. 


Lemma 9 For all t > 0, S(T(t)) is a stopping time with respect to the filtration {Tm^m £ N}. The 
a-algebra associated with the stopping time S(T(t )) is 

Fs(T(t)) ■= {71 £ T : A n { S(T(t )) = mj € F m , Vm £ N} . 

Then, vs(T(t))+i £ Fs(r(t))- kjet {Ak,k £ N + } be a sequence of sets such that Ak £ j8 (M) for all 
k £ N+, and B be a set such that B £ Fs(r(t))- Then, 

P (' v s(T(t))+k € A/., k € { 2 , 3,...}, B, S(T(t)) < 00) 

= P {v k - 1 £ A k , k £ { 2 , 3, .. .}) x P (B , S(T(t)) < 00) 

Lemma 9 can be proven similarly to Lemma 8.3 of Williams [1998a] and Lemma 7.6 of Bell and 
Williams [2001] are proven, hence we skip the proof. 

Next, we present some preliminary results related to large deviations theory. For all a > 0, let 


1{ol) := InE 


exp 




(151) 
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Then, e^ a ' < oo for all a € (—a, a) by the exponential moment assumption on v\. For x > 0, let 

Ai(x) := sup {ax — £{a)} , and A. 2 (x) := sup {ax — £(—a)} . (152) 

a>0 a>0 

Note that Ai and A 2 are not exactly but very similar to Fenchel-Legendre transform of £(a) and 
£(—a), respectively (cf. Definition 2.2.2 of Dembo and Zeitouni [1998]). Then, we have the following 
result. 


Lemma 10 Both A\ and A 2 are convex and nondecreasing in [0, 00), Ai(0) = A2(0) = 0, and 
Ai(x) > 0 and A 2 (x) > 0 for all x > 0. 

Pf: First, let us consider Ai. Let 

A* (a;) := sup {ax — £{a){ , Mx € R, (153) 

aSK 

denote the Fenchel-Legendre transform of £{a) (cf. Definition 2.2.2 of Dembo and Zeitouni [1998]). 
Then, by Parts (a) and (b) of Lemma 2.2.5 of Dembo and Zeitouni [1998], A][ is convex, A£(0) = 0, 
A\ is nondecreasing in [0, 00), and for all x > 0, 

A£(:c) = sup {ax — £(a)} = Ai(x) V 0, (154) 

a>0 

where the last equality in (154) is by (152). Moreover, by Parts (a) and (c) of Lemma 2.2.5 of Dembo 
and Zeitouni [1998], £ is convex in R, £ is differentiable in (—a, a), and £(0) = £'{0) = 0, where £' 
is the derivative of £. Then, £ achieves the global minimum at point 0; and since it is convex, £ is 
nondecreasing in the interval [0, 00). Therefore, for any fixed x > 0, there exists an a* £ (0,cf) such 
that Ai(x) > a*x — £(a*) >0. As a result, Ai(x) = A*(x) > 0 for all x > 0 by (154). Lastly, since £ 
is convex and achieves its global minimum at point 0 in its domain R and continuous in the interval 
( ^)? 

Ai(0) = - inf £{a) = ~£(0) = 0 = AJ(0), 

Q>0 

where the first equality is by (152). Hence, Ai(x) = A{(x) for all x > 0, which implies that Ai is 
convex and nondecreasing in [0, 00), Ai(0) = 0, and Ai(x) > 0 for all x > 0. 

The proofs for A 2 follows with exactly the same way. The only difference is that we consider the 
random variable v\ — l//a instead of l//r — v±. m 

Next, we will derive three more preliminary results. 

Lemma 11 For all e > 0, there exists an r^ > 0 such that if r > r%, we have 

P (S(r) > (/x + e)r) < , 

where Cs and Cg are strictly positive constants independent of r. 

Pf: 


P(S’(r) > (n + e)r) < P(5(r) > [{^ + e)r\) = P (F(U> + e)rj) < r ), 



( 155 ) 
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(156) 

| L(m + £)r\l(a) - a er ^ 1 1 , (157) 

= ex P {^L(« + (n ^ l ] )r j - <W) } , (158) 

where a > 0 is an arbitrary constant in (155), (156) is by Markov’s inequality, and the equalities in 
(157) are by the i.i.d. property of the sequence {vj,j £ N + } and (151). Since (158) is valid for all 
a > 0 , 


< E 


L(/i+e)rJ 


exp < a ^2 (- 


3 = 1 


— V 


L(M+«)rJ 

n e 




exp < cm -u 

ti¬ 



er — 1 


exp < —a 


ti¬ 


er — 1 

exp < —a-> = exp 

M 


P (5(r) > (ju + e)r) < exp j - U> + e)rj sup ~ } ’ 

= exp{-L((i + c)rJAi ’ ( 159 > 

by (152). Note that there exists an r% > 0 and e\ > 0 such that if r > r^, we have 

er — 1 

//[(// +e)rj > 61 ' 

By Lemma 10, Ai(ei) > 0 and Ai(x) is nondecreasing for all x > 0, thus the term in (159) converges 
to 0 with exponential rate as r —>• oo. To complete the proof, let Cs '■= exp(Ai(ei)) and Cg := 
(n + e)A 1 (ei). m 


Lemma 12 For all t £ M +; P ( S(t) = oo) = 0. 


Pf: Fix arbitrary t £ R + and 0 < e < l//j. Then, 


P (S(t) 


oo) = P (V(k) <t, Vk£ N+) = P 



< T , Vk £ N h 
k 


= 0 , 


because for each u £ Q except a null set, there exists a ko £ N + such that if k > kg, V(k,uj)/k > 
lf/i — eby SLLN. When k is sufficiently large, V(k, u)/k > l//r — e > t/k for almost all uj £ J), which 
completes the proof. ■ 


Lemma 13 For all e > 0 and r > 0, we have 


P|h-> erj < C 10 e- Clir , 
where C\g and Cn are strictly positive constants independent of r. 


1 


Pf: 

p („ i _i > er ) = p (|(„ i _l) > | £r ) =p ( exp {|(„ i ^l 


> 


r aer ~\ 

exp |—! 
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< E 


exp 



exp 





aer 

~2~ J ’ 


(160) 


where the inequality in (160) is by Markov’s inequality, and the equality in (160) is by (151). Since 
l {—a/2) < oo because of the exponential moment assumption, (160) gives us the desired result. To 
complete the proof, let C\q := exp{f(—d/2)} and Cu : = de/2. ■ 

Next, we will prove Lemma 8 by Lemmas 9, 10, 11, 12, and 13 in the following section. 

A.1.2 Proof of (144) 

Let 

rf := inf { t £ [0, r] : \S{T(ir) + t) — S(T(ir )) — fit\ > er} , 
where inf { 0 } := oo for completeness. Then, 


P (||S r ’ i || > er) < P ( sup | S{T{ir) + t)~ S{T{ir )) - fit] > er) = P {rf < r ), 
r VO <t<r J 

by (150). Let 

S(T(ir))+k S(T(ir))+k 

v i ’ r {k):= Vfee{2,3,...}, vf r {k):= £ Vj , Vk 6 N + , 

j=S(T(ir ))+2 j=S(T(ir ))+1 


(161) 


(162) 


V^ r {l) : = 0 and V i ’ r {k) = Vf r {k) := 0 for all k £ {..., -2, -1,0}. Then, for all t > 0, 


(|5(T(ir) + t) - S{T{ir )) - /rf| > er} 

= {S{T{ir) + t) — S{T{ir )) > [it + er} U { S{T{ir) +1) — S{T{ir )) < fit — er} 

C (S(T(zr) + 1) - S{T{ir )) > [_ fit + erj} U { S{T{ir) + i) - S{T{ir )) < fit - er]} 

C jV*’ r (|_ jit + erj) < f j U ^yf r {\nt - er]) > tj . 

Let us dehne 

rf := inf j 1 £ [0, r] : V l,r {[/j,t + erj) < f| , 

V 2 ■= inf [t € [0, r] : Vf r {fit - er]) > t j . 

Then, rf > rf Arf, so 

P {rf < r) < P {rj[ A rf < r) < P {rf < r) + P {rf < r ). (163) 


First, 


P {rf < r) 


P 


(oil (r i,r (Lf‘ t + erJ ) 



< P 

= P 

< P 


mm 
j = Ler|,...,LO+e)rj 


mm 

j=LerJ,...,L(Ai+e)rJ 


nun 
j=LerJ,...,L(Ai+e)rJ 


v-'U) - ^ < 2 ~ er 


k 


k 


V^ (j) - J - -< -——, S{T{ir )) < oo ) + P {S{T{ir)) = oo) 


k 


k 
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= P ( min (v l ' r (j) — - -< -——, S(T(ir )) < oo 

\i = L er J L(M+ e ) r J V A* / V- 

= P ( min (V (j — 1) — --< -——, S(T(ir )) < oo 

= p( min (V O') - ^ 

V=L er J-i>-»»L(/i+e)r|-i V M/ M 

= P f- f min (V(j) ~^]) > 

V \i=M- 1 v,L(/ i + e ) r J- 1 V /V / A ( 

= p( max f ^ - y (j)l > 

V=L«\|-l,... ) L(/i+e)r|-l VA 4 / A 1 

= P f max a ( — — y (j) ) > a—-- 

V=L er J- 1 >"-.L(# 1 +e)rJ - 1 VA 4 / A 1 

= P ( exp < max a ( — — y(?) ) 1 > exp < a- 

V b=M— i,...,L(M+ £ )rj-i Va* /J l At 

= P ( max exp < a ( — — y( 7 ) | > > exp < a - 

Vj=L«-J-1 L(a*+€)t-j- 1 i Va^ 7 J l A 4 


< E 


= E 


exp 


{“( 


L(/t + e ) r J - 1 


A 1 


-y([(^ + e)rj -1) 


er — 2 


exp < —a 


A* 


L(/i+e)rJ-l 


exp ^ a |-u, 


j=i 


A* 


er — 2 


exp < —ct 


= E 


L(/i+e)rJ-l 

n 

j=l 
L7+e)rJ-l 


exp < a- Vj 

1 


er — 2 


exp < —a 


= E 


II E exp {“ Q - 7 ) } ex p| 

My" 1 )} 


L7+e)rJ-l 


exp < —a 


er — 2 

*- 

At 

er — 2 


A 1 


= (e««)) l( '‘ + ‘ ,rJ - 1 ex p (- a 2!iy 


l L 


= ex P -(L(A* + O r J - 1) a 


er — 2 


A l (L(A t + e )rJ - 1) 


— ■£(«) 


(164) 

(165) 


(166) 


(167) 


(168) 

(169) 

(170) 

(171) 


where (164) is by the fact that T(t) < t for all t > 0 and Lemma 12; (165) is by Lemma 9 and (162), 
a > 0 is an arbitrary constant in (166), we use Doob’s inequality (cf. Theorem 5.4.2 of Durrett 
[2010]) in order to obtain the inequality in (167), (168) and (169) are by the i.i.d. property of the 
sequence {vj,j G N + }, (170) is by (151). Since (171) is valid for all a > 0, by (152) we have 


P (r/i < r) < exp — ( [(a* + e ) r \ ~ !) A i 


er — 2 


/e(L(/r + e)rJ - 1) 


(172) 


Second, 

P (772 < r) 
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= P ( sup (vl’ r (I ~nt 

\ 0 <t<r ' 


erl) -t) 



< P 

= P 

= P 

< 2P 


max ( Vl' r (j) — - - ^ — Cl ) > 0 






(173) 


er-i 

max V, (?)-> - 

r/ -'- 111 /V » 




v S(T(ir ))+1 


1 


V 


«S(T(ir))+l - - > 
r 


1 er — 1 


VS(T(ir))+l - + i=2 1 “ 


j -1 


2 /z 


+ p 


max ( V l,r (j) — - -- ) > 


j= 2 ,...,r(/i-e)rl 




er — 1 
2 /? 


> 


er — 1 




(174) 

(175) 


where (173) is by the fact that e < //, (174) is by (162). By the same way we derive (172), we can 
derive that the second probability in (175) is less than or equal to 

exp {—([(" - e)r l - 1 > A * —!) )}■ (176) 

Next, let us consider the first probability in (175), which is less than or equal to 

2P (v S (T(fr))+i ~~ > (T ' 2 ji 1 ’ S<yrU ~ r ^>- 2/i ^ n “ r ) ~ X ) + 2P ( S '( r ' n ~ r )> 2 A*(r n - t-) - i) 

< 2P max ^ + 2P (S(r n — r) > 2/r(r n — r) — 1) 

V'e{l|2,.",2/i(r”-r)} \ A 1 / 2 /i / 

< 4//(r Tl - r)P ^ + 2P ( S ( r?l - 0 > 2 /i(r n “ r) - 1). (177) 

By Lemmas 11 and 13, there exists an > 0 and strictly positive constants C 12 and C 13 independent 
of r and i such that if r > rq, the sum in (177) is less than or equal to Ci 2 (r n — r)e~ Cl3r . Next, let 
us consider (172) and (176). There exists an n 5 > 0 and 62 > 0 such that if r > r 5 , we have 

er — 2 er — 1 

ML(// + 6 )rj-l) >€2 ’ and 2 / ?(K / r-e)rl-l) >62 ' 

By Lemma 10, Aj(e 2 ) > 0 and Aj(x) is nondecreasing for all x > 0 and i £ {1,2}. Hence, if 
r > r 5 , the sum of the terms in (172) and (176) is less than or equal to Cus~ Clsr , where C \4 := 
2exp{Ai(e 2 ) VA 2 (e 2 )} and C 15 := (^ - e)(Ai(e 2 ) A A 2 (e 2 )). Lastly, let r 2 := r 4 Vr 5 , C 6 := C 12 VC 14 , 
and C 7 := C 13 A C 15 , then we obtain (144). 

A.1.3 Proof of (143) 

Note that proving (143) is equivalent to proving that there exist r 2 > 0 and strictly positive constants 
C 4 and C 5 which are independent of r such that if r > r 2 , 


P 


( sup | S(ir + t) — S(ir) — fit] > er 
Vo <t<r 


< C 4 r n e- C5r . 


(178) 


Hence we will explain how to prove (178). Let for all m £ N 

T m '■= cr { v k , k £ { 1 , 2 ,... , m + 1 }} . 

Then, it is easy to see that S(ir) is a stopping time with respect to the filtration {J^m £ N}. The 
rest of the proof is very similar to the one of (144) (cf. Appendix A. 1.2), the major differences are 
that 
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1. the definitions in (162) should be updated as 

S(ir)+k S(ir)+k 

V l ’ r (k):= J2 A’ Vfc € {2,3,...}, V*' r (k) := £ v i’ Vfc € N + , 

j=S(ir )+2 j'=S(ir)+l 

2. we need to use Theorem 4.1.3 of Durrett [2010] instead of Lemma 9 in order to obtain (165). 

B Proofs of the Results Stated in the Main Body 

In this section, we present the proofs of some of the results stated in the main body of the paper. 

B.l Proof of Proposition 1 

We will present the proof for the case in which servers 1 and 2 are in light traffic and servers 3 
and 5 are in heavy traffic. The other cases follow similarly. We will derive the fluid limit results first 
for server 1, second for servers 3 and 4, lastly for server 6. Results associated with the other servers 
follow similarly. By FSLLN, we have as r —>• oo 

(Sj, j €JUA)^ (Sj, j GjUi) u.o.c., (179) 

where for all t > 0, S a (t) = X a t, Sb(t) = Xbt, and Sj(t) = /ijt for all j G A. After some algebra, we 
have for all t > 0, 

o r M=T 1 (t) 

T(i) := (K(t) - a :t) - - AT !(()) + W - rf)t. 

Since server 1 works in a work-conserving fashion (cf. (5b)) and by the uniqueness of the solution 
of the Skorokhod problem (cf. Proposition B.l of Bell and Williams [2001]) with respect to A l . we 
have = ^(A^) and Q\ = 3>(Ai) for all r. By (179), sup 0<t<T |5„(t) — X r a t\ 0 for all T > 0. 
Since T\(t) < t by definition, we have for all T > 0 

sup \S\(T[(t)) — < sup |)S^(t) — Hit\ ^A 0, as r —> oo. (180) 

o <t<T o <t<T 

Therefore, X\ ^A X\ u.o.c., where X\(t) = (X a — fj,i)t. Since the mappings T and $ are continuous, 
we have n\I l ^A 'F(Ai) =: fi\Ii u.o.c. and Q 1 Q L = <J>(Ai) u.o.c. Since A a < n i, then 
1 1 (t) = (1— X a /ni)t for all t > 0 and Q 3 = 0. Then by (2a), T\ ^A T\ u.o.c. where T \ (t) = (\ a /fii)t. 
Next, let us consider server 3. After some algebra, we have for each t > 0 

Q 3 (t) = X 3 (t) + M 3-^3 (*) 

T 3 (t) := (Sj(Tl(t)) - AT[(t)) - {Sl(Tl(t)) - A 3 Tl(t)) + AiT[(t) - 

By the uniqueness of the solution of the Skorokhod problem with respect to A 3 , we have 1 AX 2 , = 
4 , (a 0) and Q 3 = 4>(A^) for all r. By (180), we have (S^(T^) — fAT'i) —A 0 u.o.c. Since T 3 (t) < t 
for all t > 0, we have (S 3 (T 3 ) — —A 0 u.o.c. by (179). Since A a = /i 3 and T\ (t) = (X a /n{)t 

for all t > 0 , we have A 3 A 3 = 0 u.o.c. Then, we have n 3 I 3 —A Vl^Ag) = 0 u.o.c. and 
Q 3 -—>■ Q 3 = 4 >(A 3 ) = 0 u.o.c. Then by (2a), T 3 ^A T 3 u.o.c. where T 3 (t) = t = (X a /fj, 3 )t for all 
t > 0 . 

Let us consider the workload process in server 4. For all t > 0, 

Wl(t) = Qlit) + = %{t) + AaX i(t), 
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u\{t) : = - ^ A (r A (jt)) - s A r A ®) + & (^(r r 2 (t)) - (. s r B (r r B (t )) - fi r B r B (t ))) - ^ 

Since we consider work-conserving policies, server 4 can be idle only if there is no workload in bufers 
4 and 5. Therefore, we can use the uniqueness of the solution of the Skorokhod problem with respect 
to Tf 4 and we have ii r A I\ = ^{U\) and W\ = 4*(C/ 4 ) for all r. By (179) and the logic of (180), we have 
(S) \(T r A ) — /i^T^) 0 u.o.c. and ( ’S r B (T B ) — figT^) 0 u.o.c. By the fluid limit results of 

servers 1 and 2 and Theorem 5.3 of Chen and Yao [2001] (Random Time-Change theorem), we have 
S’' ] (T\) 71/4 u.o.c. and S^(T^) -^4 52 (T 2 ) u.o.c. Therefore, C /4 -^4 C /4 u.o.c. where C /4 = 0 

by Assumption 3 part 3. Then, by the continuity of the mappings T and 4>, we have fi r A I 4 0 

u.o.c. and W\ 0 u.o.c. Since 0 < < W\ and 0 < Q 3 < {n r B //J r A ) W 4 ■ we have Q\ 0 

u.o.c. and Q 5 -^4 0 u.o.c. Note that, 

m) = S r i(T r i(t)) - FaFa®) - tiF A (t)) - » r A T r A {t), (181) 

Ql(t) = S r 2 (T r 2 (t )) - (S r B (T r B (t)) - S B T B {t)) - S B T B (t). (182) 

By substituting the above results in (181) and (182), we have T\ -^4 T A u.o.c. where T A (t ) = 
(X a /n A )t and -^4 T B u.o.c. where Ts(t) = (\b/HB)t for all t > 0 . 

Next, we consider server 6 . Let Z e := Q 7 A Q§. After some algebra, for each T > 0 

^ 6 ^) = X 6 (t) + ^ 6 -^ 6 ^)) 

% := S r 3 (T r 3 (t)) AS r A (T r A (t)) - (^ 6 (r 6 (f)) - ^T£(t)) - // 6 C. 

Since server 6 can be idle only if Z^(f) =0 (cf. (5d)), we can use the solution of the Skorokhod 
problem with respect to Y 6 and we have /igI 6 = 'L(Yg) and Z 6 = 4>(Y 6 ). By (179) and the logic 
of (180), we have (Sg(Tg) — XgTg) -^4 0 u.o.c. By the fluid limit results of servers 3 and 4 and 
random time-change theorem, we have ~S^(T 3 ) -^4 Ss(T 3 ) u.o.c. and S r A (T r A ) -^4 S A (T A ) u.o.c. 
Therefore, Y 6 -^4 Yg u.o.c. where Yg(f) = (A a — /ig)t for all t > 0. Then, by the fact that A a < //g 
and the continuity of the mappings 'L and 4>, we have I 6 -^4 I g u.o.c. where /g(f) = (1 — \ a /^)t 
and Zg ^^4 0 u.o.c. By (2a), Tg -^4 Tg u.o.c. where Tg(f) = (X a /fJ-e)t. Note that, 

m) -sstoo), qsw=^(^( 0 ) -ss®)). (i83) 

Therefore, we have Q 7 -^4 0 u.o.c. and Qg —4 0 u.o.c. by (183) and random time-change theorem. 

The convergence results related to Q k , k £ {2,6,9,10} and T j: j £ {2,5,7} follow similarly, 
hence we skip them. 

B.2 Proof of Proposition 2 

We provide the proof for the case V. = {2, 3, 5}, i.e servers 2, 3, and 5 are in heavy traffic together 
with server 4 but server 1 is in light traffic. The proofs of the other cases follow similarly. We derive 
the weak convergence result first for server 1, second for servers 2, 3, 4, and 5. 

We use the Skorokhod’s representation theorem to obtain the equivalent distributional represen¬ 
tations of the processes in (39) (for which we use the same symbols and call “Skorokhod represented 
versions”) such that all Skorokhod represented versions of the processes are defined in the same 
probability space and the weak convergence in (39) is replaced by almost sure convergence. Then we 
have (40) and let us consider the Skorokhod represented versions of the processes in (40). 

We first consider server 1. By (40), random time-change theorem, and Theorem 4.1 of Whitt 
[1980] (continuity of addition), we have S'” — S{ (T\ ) -^4 S a — Si(Ti) u.o.c. Since server 1 works in 
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a work-conserving fashion and is in light traffic, we have Q\ = 4>(X[) and fi \/[ = T (X[) by (19a) 
and (22). Then by Lemma 6.4 (ii) of Chen and Yao [2001], we have 


(mW + r(X r a - lA)e, Q[) ^ (~S a + Si(Ti), o) , n.o.c. (184) 

Next let us consider servers 2, 3, 4, and 5. Let Q'y, Xy, and Iy be |?7|-dimensional vectors 
derived from the vectors Q r , x r , and I r (cf. ( 20 )) by deleting the rows corresponding to each i, 
i ^ R, respectively. Let Ry denote the \H\ x \H \-dimensional matrix derived from R (cf. (21)) by 
deleting the rows and columns corresponding to each i, i ^ R. Then by (20), (21), and (22), we have 

r> r — T7 r 

'*h — + ±t H 1 n- 


By the fact that all servers work in a work-conserving fashion (cf. (5)) and Theorem 7.2 of Chen 
and Yao [2001], Qy = <j>(Xy) and Iy = ip(Xy), where ((f>,ip) is the multidimensional reflection 
mapping which is Lipschitz continuous under the uniform norm. Hence let us first focus on Xy. By 
Assumption 3 Parts 3, 4, and 5, (17), (18), (19), (40), random time-change theorem, continuity of 
addition, and the fact that Q\ —A 0 u.o.c., 


V r 


*5 


XZ 

a.s. 

XI 

— > 

vr i a vr 

L A 4 + fj, r B A 5 J 



S b -Sj(T 2 )+d 2 e 
_S a -S 3 (T 3 )+e 3 e 
S 2 (T 2 ) - S 5 (T 5 ) + (6 5 -0 2 )e 
S a - S A (T A ) + la (s 2 (T 2 ) - S B (T B )) + 


04 - £ 0 2 


= : X 


n 


u.o.c. 


(185) 


After some algebra, it is possible to see that Xy is a Brownian motion starting from 0|%| with 
drift vector 6y and covariance matrix T,y. By the continuity of the multidimensional reflection 
mapping, we have Qu = <fi(Xy) and In = if)(Xy) , where 

Qn = (Q 2 , Q 3 , < 36 ,^ 4 ) , l n = (h,h,h,h) ■ 

By Definition 3.1 of Williams [1998b], is an SRBM associated with the data (P\y\, 9y,Y,y, Ry,0\y\). 

Since the Skorokhod represented version of the processes have the same joint distribution with 
the original ones, when the Skorokhod represented versions of the processes converge almost surely 
u.o.c., then the original processes weakly converge. In other words, corresponding to the original 
processes, we have 

(Ql,QlAQl,Wi) => ( 0,Q 2 ,Q 3 ,Qe,W 4 ) . (186) 

B.3 Proof of Lemma 2 


+ 1) = sup Ik € N : •£ v r A (t) < + 4 

( A- n k 

= sup |fc+^:fc£N, ^2 v a ( 0 + ^2 — Ta( t 2n-l) + * 

l i=1 1=1 

> sup | k + A r n : k <G N, T^LL-i) + ^2 + ^n) — Ta{ t 2n—l) + ^ 


1=1 


(187) 
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= ^ + sup | A: € N : £ v\{l + <t\=A r n + E r /(t), (188) 

where (187) is by the fact that Ylt =1 v aX < ^A( r 2 ra-i) by definition of A r n . Similar to the derivation 
of (188), we can get the following result: 


+ t) = b; + sup Ik e N : £ v' 3 (l + B' n ) < (| = B' n + ££■"(<). (189) 

Note that, we have equality sign in (189) unlike the greater than or equal to sign in (187). This 
is because there is a service completion in server 3 exactly at by construction (cf. (54a)). 

Therefore, we have Yh=i v 3 ^) = ^~ 3 ( r 2 n-i); an d this gives us the equality sign in (189). Then, for 
all t > 0, 

'S 3 0^3 ( r 2n-i) + t) - S r A (T r A (r r 2n -i) +t)< E$ n (t) - EX it) + 1, 
by (188), (189), and the fact that B — A r n = 1. 

B.4 Proof of Lemma 3 

First, we present the following results from the literature which will be used later in the proof. 


Lemma 14 (Lemma 9 of Ata and Kumar [2005]) Given e > 0 and T > 2/e, we have for each 
j € J U A and a > 0 

P (0S<t “ XjA - eT ) - 

where 


Cj(e,a ) 


/2 + 2 a \ 2+2a / 18(2 + 2 ct ) 3 / 2 \ 2+2a 

Vl + 2 a) ^ (1 + 2a) 1 / 2 J 


x E [|fj(l) — l/xj| 2+2a ] 


,2/ . 1+a 


4 x)(xj + e) 


+ 



(190) 


if j € J, then Xj = Xj.; and if j € A, then Xj = pj. 


The proof of Lemma 14 can be seen in Ata and Kumar [2005] (Note that we have fixed a small typo 
in (190) by replacing ((2 + 2ct)/(l + 2a)) with ((2 + 2a)/(l + 2a))" + ' Q . This typo does not affect the 
results of Ata and Kumar [2005]). Since we assume exponential moment condition for the service 
times (cf. Assumption 2), Lemma 14 holds for all a > 0. However, the proof presented in Ata and 
Kumar [2005] requires a weaker moment assumption. 


Lemma 15 ( Equation 3.67 of Hall and Heyde [1980])For any martingale with differences 
Zi, 1 < i < n and any p >2, we have 


E 




PI 


< (18pq 1 / 2 ) p n p / 2 max E [| Zf v ] , 


where q = (1 — p x ) 1 
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The proof of Lemma 3 is a modification of the proof of Lemma 14. We present the proof for the 
case j = 3, the other case follows similarly. Let us define 

£n,3--={B r „<nS 3 +e9)r 2 T]}, (191) 

where eg > 0 is an arbitrary constant. Then, 

N r / 

E P sup \E 3 ,n (t) - (J%t\ > e 2 r 7 T, r£ n _i < r 2 T 

\0<t<r^T 

N r N r 

<^P(4_i<r 2 T, (^ >3 ) C ) + ^P 

n =1 n=l 

Let us consider the first sum in (192). 


sup 

\0<t<r'rT 


\E r 3 ’ n (t)-^t\>e 2 v^T, £, 


n, 3 


(192) 


N r 


N r 


E P K-i < r 2 T, (f; 3 ) c ) < E P K-i < r 2 T, S^T^r^)) > (jjT 3 + e 9 )r 2 T) 


71=1 


n= 1 


< 


JV r N r / \ 

EP(^(r 2 T)>(^ + e 9 )r 2 T)<EP sup |S 3 r (t) - > e 9 r 2 T 

n=l \0<t<r 2 T , 


71=1 


= WP f sup |S 3 (t) - ^i| > e 9 r 2 T ) < iV : 
\0<t<r 2 T J 


■ C 3 (e9,a 2 ) 
(r^T) i+ Q 2 


0 as r oo, (193) 


where ck 2 > 0 is an arbitrary constant, and (193) is by Lemma 14 and the fact that N r = 0(r 2 ). 
Moreover, for each r, C 3 (eg,a 2 ) is the constant defined in Lemma 14 associated with the renewal 
process S 3 . It is straightforward to see that C 3 (eg,a 2 ) is bounded above by a constant uniformly in 
r, hence we get the convergence result in (193). 

Next, let us consider the second sum in (192). Let us fix n and r. Let 


V ■= inf {t € [0, r 7 T) : | E% n (t) - fi 3 t\ > e 2 r 7 T} , 


where inf{0} := oo for completeness. Then 


p sup | El' n (t) - n r 3 t\ > e 2 r 7 T, £ r n3 = P (rj < r^T, £ r n 3 ) . 

\0<t<r~/T J 

Let V^(-k) : = 0 for all k G N and VJ(k) := E=i v l( B n + *)■ Then > 

m n {t) - > e 2 r^T} = {E 3 n (t) > fit + e 2 r 7 T} U {E^(t) < - e 2 r 7 T} 

C {. l%*(t ) > L+ e 2 r 7 Tj} U {E% n (t) < \fit - 6 2 r 7 T]} 
= + e 2 r 7 Tj) < t} U {V 3 r (\^t - ^T\) > t}. 

Let us define 

m ■= inf {t € [0, r 7 T) : V 3 ([^ r 3 t + e 2 r 7 Tj) < t} , 
rj 2 := inf |t € [0,r 7 T) : V^{\fi r 3 t - e 2 r 7 T]) > t} . 
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Then rj > rji A r/ 2 - Thus 

P {V < r^T, S r nj3 ) < P (771 A r /2 < r 7 T, £^ 3 ) <P{ m < r^T, ^ 3 ) + P (r ? 2 < r 7 T, £ r n;i ) . (194) 

We will consider the two probabilities after the second inequality in (194) separately. First, 

■ * f, _ rn t rm i Trr<\ r, , n Lm^ + ^r 7 Tj + e 2 r 7 rj 

771 = mi a£|0,r r T) : V 3 (|_/i 3 t + e 2 r 'TJ)- 2 --- < t ----- 


M 3 


Ms 


Next, let us define 


• t j*^rn Fr/i r, , n LmS* + ^rTTj e 2 r 7 T 

771 := mf <f € [0, r 7 T) : P 3 ([ /J, 3 t + e 2 r 7 TJ)----- < - 


Ms 


2 m^ 


When r is sufficiently large, e 2 r 7 T > 2. This implies 


e 2 r 7 T ^ + LM^ + e 2 r 7 Tj 


2m 3 M 3 

and P (r/i < r 7 T, ££ 3 ) < P (771 < r 7 T, ££ 3 ) when r is sufficiently large. Then, 

e 2 r 7 T 


P ( 57 ! < r 7 T, £; 3 ) < P 


sup 

,i=l,2,...,L(^+e 2 )rT'Tj 


™ -14 


> 


2 Ms 




< E 


= E 


sup 

,;=i,2,...,L(mS+c2)^tj 


W) - —7 

M 3 


2+2a 




e 2 r 7 T 

2m5 


n,3 


— (2+2q) 


r(/4+6 9 )r 2 Tl 

E 

7=1 


r(M3+e9)f 2 'rl 

E E 

7=1 


sup 

v i=l,2,...,L(M§+e2)r'TTJ 


sup 

0=i,2,...,L&*5+«)rTrj 


™ * 14 


2 + 2 « 


i (K = j ) 


e 2 r 7 T 

^mT 


-(2+2a) 


P3 r (*+i)-P3 r (j)-- 

M 3 


2+2a 


(195) 


W = j) 


e 2 r 7 T 

2M5 


<r(^ + e 9 )r 2 T]E 


sup 

,i=l,2,...,L(MS+e 2 )rT'Tj 


™ -14 


2+2o 


( e 2 r^T \ 

V 2/zJ J 


-(2+2 a) 


— (2+2«) 


(196) 


where a > 0 is an arbitrary constant such that 4/(1+a) < 7 ; the first inequality is by the definition of 
771 (cf. (195)); the second inequality is by Markov’s inequality; the first equality is by the definition of 
£ r n ' 3 (cf. (191)); the second equality is by the fact that (V 3 (j+i) — V 3 (j))I (B^ = j ) = V 3 (i)I (B^ = j ) 

for all i, j £ N+; and the last inequality is due to the fact that I (B^ = j) < 1 and V 3 (j + i) — V 3 ( j ) = 
V[(i) by the i.i.d. property of {v 3 (i),i £ N+}. Note that 


{( W) _+), i£N+ } 

is a martingale. Then, by the L p maximum inequality (cf. Theorem 5.4.3 of Durrett [2010]), we see 
that the term in (196) is less than or equal to 


Km5 + e 9 )r 2 T 1 


/2 + 2 a\ 

Vl + 2 cJ 


(2+2a) 

E 


P 3 r (L(M3 + ^)r 7 Tj) 


L(^ + e 2 )r 7 Tj 

MS 


2+2a 


/ e 2 r 7 T\ _(2+2a) 

WmW 

(197) 
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Then, by using Lemma 15 on (197), the term in (197) is less than or equal to 

' 2 + 2a\ ( 2+2a ) / -" M(2+2a) 


f(^ + e 9 )r T] 


1 + 2 a 






xE uj(l) - ^ 

^3 

so does P (rj\ < r r/ T. S^ 3 ). 

Now, we will consider the second probability in (194). First, 


2+2 a 


/e 2 +rr (2+2Q) 


(198) 


■ ()i ,rn -yrm T+rvr r* 7^11 1^3* “ + |m^ “ e 2r 7 T"| 

r ] 2 = mf t € [0, r 7 T) :V 3 {\ n 3 t - e 2 r 1 T \)-^--- >t- 


+3 




Next, let us define 


• c J^rn 7^3 T^rvr O 7^1N [7*3* ^ «2 \ 

rj 2 := mf t <E [0,r 7 T) : P 3 (f/i 3 t - e 2 r 7 T|)-*—_- > j 


M3 


When r is sufficiently large, e 2 r 7 T > 2. This implies 


e 2 r 7 T < t _ \n 3 t — e 2 r 7 T] 


2/r 3 M 3 

and P (r / 2 < r 7 T, ££ 3 ) < P (rj 2 < r 7 T, ££ 3 ) when r is sufficiently large. Moreover, since M 3 > £2 
when r is sufficiently large, |~(m 3 — e 2 )r 7 T] > 0 when r is sufficiently large. Then, 

+o-4> £2r7T N 


P 1+ < r-T, + ..) < P 


< P 


sup 

.i=l,2,...,l(n r 3 -e 2 )ryT] 


sup 

i=l,2,..., L(£ t 3+ £ 2+ 7 TJ 


/*» 


W) - 7 + 


> 


2 mS ’ 

e 2 r 1 T 


2m7 


P 

? 


n, 3 


(199) 


< 


r/ r \ 2 t-ih /2 + 2a \ (2+2a) f 18(2 + 2a) 3 / 2 \ 

^+7+n(—) ( ( ; +2a)1 / j 


(2+2a) 


x(KM7 + 6 2 )r 7 Tl) 


1 +a 


E 


” 5(1) - k 


2+2 a 


£9r 7 T\ _(2+2a) 

J , ( 200 ) 


2 m; 


where the inequality in (199) is by the fact that e 2 r 7 T > 1 when r is sufficiently large (this implies 
|~(m 3 — e 2 )r 7 T] < [(m 3 + e 2 )r 7 Tj when r is sufficiently large) and (200) is by (196) and (198). 

By the exponential moment assumption (cf. Assumption 2), both of the right hand sides in (198) 
and (200) are O(r 2 ~ 7 ( 1 +C 0). By (194) and the fact that 4/(1 + a) < 7 , we have 

P ( SU P 1^3 > £2 r 7 T , ££ 3 '| 

0<t<r^T / 


n=l 


lV r 


< E (P fa < r 7T ’ ^, 3 ) + P fa < r 7 T, f; 3 )) 

n=l 

= lV r O(r 2 - 7 ( 1+ “)) = 0 ( r 4 - 7 ( 1+ “)) -+ 0 , as r -+ 00 . 
By the fact that = B T n — 1 and using the same technique, we can also prove that 


P ( SU P Ifafa) - /4*| > £2i- 7 P, < r 2 T^j -+ 0, 

^ Vo<t<r7T / 


as r —>• 00 . 
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B.5 Proof of Lemma 4 

Consider the sum in (95), which is less than or equal to 


ivr 


n— 1 


£ p w« s At+<*). simrS)) > ivj) 

+ E p Kt' < >' 2 (P + £4), SJ(t£*) > JVJ) 

iVJ 

+ X] P Kn < r2 ( r + <t), Q r 2 (r 2 n) > ((^2 A fl B - k - A 6 ) A e 5 ) r/2) 


72=1 


72—1 


(201) 

( 202 ) 

(203) 


First, consider the sum in (201), which is less than or equal to 


m 


E P ( T S < r 2 (T + £ 4 ), S' B (r B ( rSD > (AS + u!- B )r 2 (T + £„)) 


n=l 


m 


< 


^ P ( S B (r 2 (T + £ 4 )) > (A^ + fi B )r 2 (T + e 4 )) 


72— 1 

< ^ 2 r P ( sup |S£(t) - ,/ R t\ > X r b r 2 (T + £4) 

\0</<r 2 (T+e 4 ) , 

as r — 00 , 


[r 2 (T + £ 4 )] 


(204) 


where aq > 0 is any arbitrary constant and (204) is by Lemma 14 (cf. Appendix B.4). For each 
r, (A£, aq) is the constant defined in Lemma 14 associated with the renewal process S r B . It is 
straightforward to see that C B (X b , aq) is bounded above by a constant uniformly in r, hence we get 
the convergence result in (204). We can show that the sum in (202) converges to 0 by the same way 
we derive (204). 

Next let us look at the sum in (203), which is less than or equal to 


(( 1*2 * HB - K - \b) A e 5 ) r\ 

Z^P SU P <22W > - 5 - • (205) 

n =1 \° <t<r 2 (T+e 4 ) 1 J 

Let us construct a hypothetical GI/GI/1 queue, where the buffer length process is denoted by Q^\ 
the interarrival and service time sequences are {v b {i){X b /{X^ + /c/ 2 )), i € N + } and {v^i)(^ 2/(^2 ~ /c/ 2 )), i G N + } 
respectively', and := Q 2 {ty = 0- Hence, the arrival and service rates in the hypothetical 

GI/GI/1 queue are Xb + n/2 and /j, 2 — /c/2, respectively. By Assumption 3 Parts 1 and 2, there exists 
an r$ > 1 such that if r > r^, the term in (205) is less than or equal to 


JV 2 

£p 

72— 1 


sup 

\0<i<r 2 (T+e 4 ) 


Q2 1} 


(t)> 


feA/ifi - k - Aft) A £ 5 ) r 


(206) 


The term in (206) converges to 0 as r —> 00 by the fact the corresponding server is in light traffic 
and Proposition 5. 

7 Since t>J(i)(A£/(A& + k/2)) = Vb(i)/(\b + «/2) and ^ 2 ( 1 )(^ 2/(^2 - /c/2)) = V 2 (i)/(fJ .2 - /c/2) for all i € N+ (cf. 
Section 3.1), the corresponding sequences are independent of r 
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B.6 Proof of Lemma 5 

In the optimization problem (114), replacing <74 with w 4 — (ha/Hb)% gives us the following 
equivalent optimization problem which has only one decision variable. 


HA 


Hb 


min h 4 [w 4 - q 5 + h 5 q 5 + h 7 I w A - q :i - -q 5 


HA 


Hb 


+ h 8 (q 3 -W4+ —q 5 

V Hb 

+ hg (q 6 - q 5 ) + + hio (q 5 - 9e)" 


, n ^ ^ HB 

s.t. U < <75 < — W4. 

HA 


(207a) 

(207b) 


The objective function (207a) is the sum of six different functions each of which is convex, continuous, 
and piecewise linear with respect to the decision variable <75 . Since sum of finitely many convex (con¬ 
tinuous, piecewise linear) functions is convex (continuous, piecewise linear), the objective function 
(207a) is also convex, continuous, and piecewise linear with respect to the decision variable q§. Then, 
an optimal solution should be either in the boundaries of the feasible region of <75, which are 0 and 
(hb/ha)w 4 (cf. (207b)), or in one of the break points which are in the interval [0, (hb/ Ha)wY\ of 
the convex, continuous, and piecewise linear objective function (207a), which are (hb / Ha)(w 4 — <? 3 ) + 
and (/g A (hb/ha)w 4 . Therefore, the optimal solution, denoted by q£, is such that 


* _ / n Hb hb , x+ I 

? 5 e 0, — w 4 , —( w 4 - <73r, 96 A — w 4 } . 

{ HA HA HA j 

Then the optimal solution set in Table 3 follows by plugging q£ in the equality constraint (114b). 

B.7 Proof of Lemma 6 

For each j £ {1,2,..., n}, hj (qj — qj,Y) + is a convex and continuous function of the decision 
variables < 7 *, i £ { 1 , 2 ,... , n}. Since sum of finitely many convex and continuous functions is also 
convex and continuous, then the objective function (131a) is also convex and continuous function 
of the decision variables qj, j £ {1,2,... ,n}. Then, a local optimum solution of the optimization 
problem (131) is also a global optimum and we will prove that the solution (132) is a local optimum. 
First, it is easy to see that the solution (132) satisfies the constraints (131b) and (131c), so it is 
feasible. We will prove that the solution (132) is a local optimum by showing that any deviation 
from this solution does not improve the objective function value. 

Let us fix an arbitrary j £ {1, 2,..., n — 1} and consider qj. Note that the cost incurred due 
to < 7 j is 0 (cf. Remark 14). Thus, decreasing qj cannot decrease hj (qj — < 7 j, 2 ) + but may increase 
the objective function value because at least one i £ {1,2,..., n}\{j} will increase by (131b). 
Therefore, decreasing qj, j £ {1, 2,... , n — 1} does not improve the objective function value. 

Next, let us increase qj by e where e > 0 but sufficiently small. Increasing qj by e increases the 
objective function value by at most hje. Since h t (qi — < 7 *, 2 ) + = 0 for all i £ {1,2,..., n — 1}\{j}, 
decreasing qi, i £ {1,2,... ,n — l}\{j} will not decrease the objective function value. Therefore, 
increasing qj but decreasing qi, i £ {1,2,... ,n — 1 }\ {.7} does not improve the objective function 
value. Next, let us decrease q n . There are two cases to consider. First, suppose that qj > q Jt 2 - Then, 
increasing qj by e increases the objective function value exactly by hje. In this case, decreasing q n 
by Hn^/Hj (cf- (131b)) can decrease the objective function value by at most h n Hn^/Hj which is less 
than or equal to hje because hjHj > /i n /i n , thus the net change in the objective function value is 
nonnegative. Second, suppose that qj < < 7 ^ 2 - Then, increasing qj by e will not increase the objective 
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function value for sufficiently small e. Note that, when qj < qj 2 , by (132a) and (132b) 


> Sl = 

Hj fij 


w, 


w — 


gi ,2 

mi 


92,2 

M2 


if 3 = 1, 

- q i^A , ifj€{2,3,...,n-l}. 


(208) 


This implies that if j = 1. q n = 0 by (131b) and (131c), and if j € {2, 3,..., n — 1}, then by (132c) 
and (208), 


Qn — 


— f^n 

~ Mn 
= 0. 




gj,2 

Mi 


, + \ + 

\ 

\ Qj+ 1,2 \ 

Qn- 1,2 \ 

/ Hj+i j 

f^n— 1 J 



gj+l,2 \ + 
Mj+1 J 


Qn—1,2 
Hn—1 


+ 



Qn- 1,2 
Hn—\ 


Hence, when qj < qj^, q n = 0 so it cannot be decreased by (131c). As a result, increasing qj by e does 
not improve the objective function value. Therefore, increasing or decreasing qj does not improve 
the objective function value for all j G { 1 , 2 ,... , n — 1 }. 

Lastly, let us consider q n . If we increase (decrease) q n , then some of the qj, j € {1, 2,..., n — 1} 
must increase (decrease) by (131b) and (131c). Since the latter change does not improve the objective 
function value, changing the value of q n does not improve the objective function value. Therefore, 
the solution (132) is a local optimum and also a global optimum. 

B.8 Proof of Lemma 7 

By decreasing the number of decision variables from 4 to 2 by using the equality constraints and 
then simplifying the notation in the optimization problem (136), we get the following equivalent 
optimization problem: 

min h a (w 5 --^-q 6 - q 4 ) + h b (q 6 V q 7 ) + h c ( ^-(w 6 - q 7 ) - q 9 ) , 

V VBi J V/LB 2 J 

S.t. 0 < (?6 < 0 < q-? < Wq, 

HA 

where the decision variables are q§ and qj. Note that increasing q$ or qy decrease the first and 
third terms but increase the second term in the objective function (209a). Hence, the key point is to 
compare the total decrease in the first and third terms with the increase in the second term in (209a) , 
when we increase q& and q 7 from 0 to their corresponding upper bounds. We solve the optimization 
problem (209) case by case. 

First consider the case q 4 > W 5 . Then, the first term in the objective function (209a) is 0 for 
all values of qe satisfying the constraints in (209b). Since increasing q$ increases the second term 
in (209a), q$ = 0 in the optimal solution. Since hb > h c ^c/hB 2 by assumption, then increasing 
q~i does not decrease the objective function value, thus q-j = 0 in the optimal solution. In the 
case qg > {hc/HB 2 )'uoq, qg = q-j = 0 in the optimal solution by the same logic and the fact that 
hb > h a HA/H bi- Therefore, the last case to consider is > q 4 and (hc / HB 2 )wq > qg and we will 
consider it in two cases. 


(209a) 

(209b) 
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Case 1 : hb > h a ^A/hBi + h c pic / hB 2 By the second term in (209a), it is more efficient to 
increase q$ and qj together with the same rate (if possible). Suppose that q§ = qj = 0. If we 
increase qe and q? by sufficiently small eio > 0 , then the objective function value increases by 
(hb — h a HA/ hB\ — h c iic/> 0. Therefore, we should not increase q$ and <77 at all, and 
% = Q7 = 0 in the optimal solution. Lastly, (137) follows by the two equality constraints of the 
optimization problem (136). 

Case 2: hb < ^a/d/Msi + hchc / hB 2 Suppose that qe = q^ = 0. If we increase q§ and 
q^ by sufficiently small eio > 0 , then the objective function value changes by (hb — h a ^A/hBi — 
hchC / hB 2 )tio < 0. Hence, it is efficient to increase q& and q-j together until either qe = (ubi/ hA){u >5 — 
54 ) or <77 = wq — {^B 2 /hc)<l 9 - After this point, if we increase at least one of the qe or < 77 , then the 
objective function value increases because first or third term in (209a) is in its lower bound (which is 
0) at this point and hb > h a [iA/hB\ and hb > h c nc/hB 2 ■ Therefore, q& = qj = min{( 7 x 51 ///^) (105 — 
(74), wq — (hB 2 /hc)Q 9 }- Then, (138) follows by the two equality constraints of the optimization 
problem (136). 

Remark 16 The solution (138) can be explained intuitively in the following way. Note that, we 
start with the solution Q§ = Q 7 = 0 in the proof of (138), which is equivalent to giving full priority 
to type b jobs in servers 5 and 6 . Then we increase Qq and Q 7 , which is equivalent to giving some of 
the priority to type a and c jobs. We do this until the point in which giving more priority to type a 
and b jobs does not increase the throughput rate of at least one of these job types because more than 
necessary jobs accumulates in buffers 11 or Ij. Therefore, we stop giving priority to type a and b 
jobs at this point. 

C Detailed Simulation Results 

In this section, we present the detailed results of the simulation experiments. 
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Ins. 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 


Table 4: Detailed results of the simulation experiments: Average queue lengths with their 95% confidence intervals. 


Proposed Policy 

SDP Policy 

Static Priority Policy 

FCFS Policy 

Randomized Policy 

Randomized-2/3 Policy 

Q£(0 + Qf(i) 

Qe(i) + Qio(i) 

Qs(i) + Qr(i) 

QeW + QloW 

Qs(i) + Qr(i ) 

%W + Qlo (0 

Qs(i) + Qr(i) 

Qe(i) + Qr »W 

Qs(i) + Qr(i) 

Qe(i) + Qio(i ) 

Qs(i) + Q,(i ) 

Qe(i) + Qio(i ) 

14.01 ± 0.16 

14.43 ± 0.17 

14.01 ± 0.16 

14.43 ± 0.17 

13.6 ± 0.13 

18.21 ± 0.20 

14.72 ± 0.14 

14.75 ± 0.17 

14.51 ± 0.14 

14.72 ± 0.17 

13.59 ± 0.15 

17.31 ± 0.14 

21.91 ± 0.32 

23.9 ± 0.27 

21.91 ± 0.32 

23.9 ± 0.27 

21.55 ± 0.32 

29.58 ± 0.36 

23.36 ± 0.24 

23.33 ± 0.18 

23.75 ± 0.37 

23.14 ± 0.22 

21.49 ± 0.27 

28.09 ± 0.30 

45.73 ± 0.84 

52.44 ± 0.82 

45.73 ± 0.84 

52.44 ± 0.82 

44.8 ± 0.88 

62.75 ± 0.89 

50.01 ± 0.69 

49.86 ± 0.70 

50.02 ± 0.71 

49.66 ± 0.75 

44.91 ± 0.79 

60.59 ± 0.92 

14.09 ± 0.19 

6.53 ± 0.06 

14.09 ± 0.19 

6.53 ± 0.06 

13.67 ± 0.19 

13.28 ± 0.11 

14.7 ± 0.17 

7.56 ± 0.05 

14.49 ± 0.13 

7.71 ± 0.06 

13.6 ± 0.15 

12.12 ± 0.14 

21.86 ± 0.22 

11.63 ± 0.16 

21.86 ± 0.22 

11.63 ± 0.16 

21.52 ± 0.27 

21.35 ± 0.24 

23.43 ± 0.28 

12.17 ± 0.11 

23.62 ± 0.24 

12.18 ± 0.10 

21.79 ± 0.29 

19.53 ± 0.20 

45.84 ± 0.75 

26.69 ± 0.54 

45.84 ± 0.75 

26.69 ± 0.54 

44.9 ± 0.84 

44.9 ± 0.49 

49.7 ± 0.58 

25.78 ± 0.26 

50.57 ± 0.96 

25.57 ± 0.31 

44.31 ± 0.74 

41.86 ± 0.64 

3.67 ± 0.00 

17.27 ± 0.14 

3.67 ± 0.00 

17.27 ± 0.14 

3.03 ± 0.00 

18.07 ± 0.14 

7.59 ± 0.05 

14.69 ± 0.15 

7.75 ± 0.07 

14.73 ± 0.22 

3.64 ± 0.01 

17.32 ± 0.16 

5.97 ± 0.01 

28.03 ± 0.35 

5.97 ± 0.01 

28.03 ± 0.35 

5.12 ± 0.01 

29.11 ± 0.33 

12.26 ± 0.11 

23.64 ± 0.24 

12.37 ± 0.13 

23.58 ± 0.28 

5.9 ± 0.01 

27.95 ± 0.27 

12.59 ± 0.04 

60.99 ± 1.00 

12.59 ± 0.04 

60.99 ± 1.00 

10.99 ± 0.03 

62.74 ± 0.79 

25.73 ± 0.32 

50.75 ± 0.86 

25.58 ± 0.33 

50.15 ± 0.74 

12.42 ± 0.04 

60.49 ± 0.79 

3.67 ± 0.00 

11.95 ± 0.12 

3.67 ± 0.00 

11.95 ± 0.12 

3.03 ± 0.00 

13.25 ± 0.11 

7.57 ± 0.06 

7.55 ± 0.06 

7.69 ± 0.06 

7.75 ± 0.05 

3.64 ± 0.01 

12.23 ± 0.14 

5.98 ± 0.01 

19.13 ± 0.16 

5.98 ± 0.01 

19.13 ± 0.16 

5.12 ± 0.01 

21.55 ± 0.20 

12.15 ± 0.10 

12.13 ± 0.10 

12.24 ± 0.10 

12.22 ± 0.10 

5.9 ± 0.01 

19.43 ± 0.25 

12.58 ± 0.03 

41.4 ± 0.54 

12.58 ± 0.03 

41.4 ± 0.54 

10.99 ± 0.03 

45.94 ± 0.53 

25.4 ± 0.29 

25.33 ± 0.29 

25.84 ± 0.39 

25.59 ± 0.40 

12.44 ± 0.04 

41.74 ± 0.48 

2.4 ± 0.00 

17.95 ± 0.13 

2.73 ± 0.00 

17.83 ± 0.18 

2.4 ± 0.00 

17.95 ± 0.13 

7.49 ± 0.06 

14.8 ± 0.15 

7.55 ± 0.06 

14.58 ± 0.16 

3.32 ± 0.01 

17.4 ± 0.18 

3.79 ± 0.01 

29.18 ± 0.26 

4.19 ± 0.01 

29.06 ± 0.27 

3.79 ± 0.01 

29.18 ± 0.26 

11.78 ± 0.10 

23.62 ± 0.32 

11.92 ± 0.12 

23.79 ± 0.31 

5.09 ± 0.01 

28.11 ± 0.19 

7.76 ± 0.02 

62.26 ± 0.84 

8.4 ± 0.02 

62.04 ± 0.89 

7.76 ± 0.02 

62.26 ± 0.84 

24.96 ± 0.28 

49.49 ± 0.67 

24.91 ± 0.28 

50.05 ± 0.66 

10.15 ± 0.03 

60.9 ± 0.91 

2.41 ± 0.00 

13.34 ± 0.14 

2.73 ± 0.00 

12.95 ± 0.11 

2.41 ± 0.00 

13.34 ± 0.14 

7.45 ± 0.05 

7.56 ± 0.05 

7.58 ± 0.05 

7.66 ± 0.05 

3.32 ± 0.01 

12.21 ± 0.11 

3.8 ± 0.01 

21.44 ± 0.22 

4.19 ± 0.01 

20.91 ± 0.23 

3.8 ± 0.01 

21.44 ± 0.22 

11.76 ± 0.11 

12.05 ± 0.11 

11.94 ± 0.10 

12.23 ± 0.11 

5.08 ± 0.01 

19.4 ± 0.19 

7.76 ± 0.02 

45.28 ± 0.63 

8.41 ± 0.02 

45.26 ± 0.61 

7.76 ± 0.02 

45.28 ± 0.63 

25.08 ± 0.30 

25.83 ± 0.29 

24.78 ± 0.32 

25.62 ± 0.32 

10.16 ± 0.02 

41.69 ± 0.48 

38.22 ± 0.46 

24.1 ± 0.30 

38.22 ± 0.46 

24.1 ± 0.30 

38.45 ± 0.45 

29.22 ± 0.37 

40.1 ± 0.44 

23.77 ± 0.29 

40.02 ± 0.49 

23.56 ± 0.30 

38.26 ± 0.41 

28.41 ± 0.33 

22.7 ± 0.21 

28.23 ± 0.31 

22.7 ± 0.21 

28.23 ± 0.31 

21.7 ± 0.28 

29.38 ± 0.31 

28.63 ± 0.33 

23.27 ± 0.29 

29.38 ± 0.38 

23.77 ± 0.30 

22.46 ± 0.25 

28.03 ± 0.27 

20.34 ± 0.24 

29.25 ± 0.28 

20.86 ± 0.27 

28.94 ± 0.33 

20.34 ± 0.24 

29.25 ± 0.28 

28.44 ± 0.31 

23.74 ± 0.30 

29.45 ± 0.41 

23.38 ± 0.29 

21.71 ± 0.36 

28.26 ± 0.36 

38.69 ± 0.41 

11.79 ± 0.17 

38.69 ± 0.41 

11.79 ± 0.17 

37.7 ± 0.52 

21.36 ± 0.21 

40.21 ± 0.57 

12.13 ± 0.10 

39.71 ± 0.55 

12.27 ± 0.12 

37.54 ± 0.50 

19.66 ± 0.17 

22.53 ± 0.32 

19.18 ± 0.18 

22.53 ± 0.32 

19.18 ± 0.18 

21.72 ± 0.25 

21.24 ± 0.25 

28.77 ± 0.29 

12.12 ± 0.09 

29.32 ± 0.33 

12.25 ± 0.14 

22.64 ± 0.25 

19.66 ± 0.20 

20.52 ± 0.35 

21.53 ± 0.23 

20.75 ± 0.39 

20.71 ± 0.22 

20.52 ± 0.35 

21.53 ± 0.23 

28.43 ± 0.38 

12.19 ± 0.10 

28.63 ± 0.30 

12.19 ± 0.10 

21.76 ± 0.30 

19.54 ± 0.21 

38.45 ± 0.47 

41.92 ± 0.44 

38.45 ± 0.47 

41.92 ± 0.44 

38.03 ± 0.56 

49.34 ± 0.49 

39.97 ± 0.46 

39.95 ± 0.49 

40.03 ± 0.42 

40.24 ± 0.41 

38.01 ± 0.44 

47.26 ± 0.42 

22.54 ± 0.28 

47.98 ± 0.45 

22.54 ± 0.28 

47.98 ± 0.45 

21.58 ± 0.26 

48.6 ± 0.52 

28.67 ± 0.28 

40.13 ± 0.51 

28.94 ± 0.38 

40.05 ± 0.50 

22.56 ± 0.33 

47.45 ± 0.48 

20.42 ± 0.22 

49.32 ± 0.47 

21.15 ± 0.33 

48.82 ± 0.49 

20.42 ± 0.22 

49.32 ± 0.47 

28.79 ± 0.30 

39.97 ± 0.50 

29.09 ± 0.32 

40.1 ± 0.43 

21.72 ± 0.34 

47.28 ± 0.56 

38.42 ± 0.46 

31.2 ± 0.40 

38.42 ± 0.46 

31.2 ± 0.40 

37.97 ± 0.44 

43.79 ± 0.50 

40.09 ± 0.42 

28.64 ± 0.40 

39.99 ± 0.40 

29.1 ± 0.28 

38.25 ± 0.51 

41.24 ± 0.56 

22.37 ± 0.30 

41.59 ± 0.54 

22.37 ± 0.30 

41.59 ± 0.54 

21.52 ± 0.31 

44.12 ± 0.48 

28.75 ± 0.35 

28.51 ± 0.24 

29.22 ± 0.38 

29.25 ± 0.30 

22.38 ± 0.25 

41.05 ± 0.51 

20.3 ± 0.27 

43.93 ± 0.53 

20.81 ± 0.33 

43.96 ± 0.58 

20.3 ± 0.27 

43.93 ± 0.53 

28.5 ± 0.35 

28.74 ± 0.27 

29.34 ± 0.28 

29.48 ± 0.27 

21.61 ± 0.26 

41.1 ± 0.52 

21.88 ± 0.32 

41.75 ± 0.61 

21.88 ± 0.32 

41.75 ± 0.61 

21.39 ± 0.25 

48.81 ± 0.43 

23.6 ± 0.21 

40.01 ± 0.45 

23.45 ± 0.32 

40.24 ± 0.37 

21.52 ± 0.31 

47.52 ± 0.49 

5.97 ± 0.01 

47.48 ± 0.61 

5.97 ± 0.01 

47.48 ± 0.61 

5.11 ± 0.01 

49.18 ± 0.51 

12.33 ± 0.10 

40.12 ± 0.44 

12.26 ± 0.10 

40.05 ± 0.44 

5.91 ± 0.01 

47.13 ± 0.34 

3.79 ± 0.00 

48.75 ± 0.43 

4.19 ± 0.01 

48.52 ± 0.50 

3.79 ± 0.00 

48.75 ± 0.43 

11.8 ± 0.07 

39.66 ± 0.49 

11.96 ± 0.12 

39.93 ± 0.40 

5.1 ± 0.01 

46.74 ± 0.46 

22.1 ± 0.33 

30.72 ± 0.46 

22.1 ± 0.33 

30.72 ± 0.46 

21.39 ± 0.29 

43.98 ± 0.37 

23.4 ± 0.26 

28.67 ± 0.31 

23.37 ± 0.30 

29.26 ± 0.28 

21.64 ± 0.30 

41.25 ± 0.41 

5.97 ± 0.01 

41.51 ± 0.45 

5.97 ± 0.01 

41.51 ± 0.45 

5.11 ± 0.01 

43.48 ± 0.44 

12.18 ± 0.08 

28.72 ± 0.25 

12.26 ± 0.12 

29.25 ± 0.31 

5.91 ± 0.01 

41.22 ± 0.43 

3.79 ± 0.01 

44.01 ± 0.52 

4.19 ± 0.01 

43.22 ± 0.45 

3.79 ± 0.01 

44.01 ± 0.52 

11.88 ± 0.12 

28.96 ± 0.26 

11.89 ± 0.12 

29.03 ± 0.38 

5.09 ± 0.01 

40.94 ± 0.46 
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